Dna diagnostics based on mass spectrometry

ABSTRACT

Fast and highly accurate mass spectrometry-based processes for detecting a particular nucleic acid sequence in a biological sample are provided. Depending on the sequence to be detected, the processes can be used, for example, to diagnose a genetic disease or chromosomal abnormality; a predisposition to a disease or condition, infection by a pathogenic organism, or for determining identity or heredity.

RELATED APPLICATIONS

[0001] This application is a continuation of International PCT application No. PCT/US97/20444, filed Nov. 6, 1997. This application is also a continuation-in-part of U.S. application Ser. No. 08/744,481, filed Nov. 6, 1996, to Koster, entitled “DNA DIAGNOSTICS BASED ON MASS SPECTROMETRY”. This application is also a continuation-in-part of U.S. application Ser. Nos. 08/744,590, 08/746,036, 08/746,055, 08/786,988, 08/787,639, 08/933,792 and U.S. application Ser. No. 08/947,801, filed Oct. 8, 1997, which is a continuation-in-part of U.S. application Ser. Nos. 08/746,055, 08/786,988 and 08/787,639. Benefit of priority is claimed to each of these applications.

[0002] This application is related to U.S. patent application Ser. No. 08/617,256 filed on Mar. 18, 1996, which is a continuation-in-part of U.S. application Ser. No. 08/406,199, filed Mar. 17, 1995, now U.S. Pat. No. 5,605,798, and is also related U.S. Pat. Nos. 5,547,835 and 5,622,824.

[0003] The subject matter of each of the above-noted patent applications and the patent is herein incorporated in its entirety.

BACKGROUND OF THE INVENTION

[0004] 1. Detection of mutations

[0005] The genetic information of all living organisms e.g., animals, plants and microorganisms) is encoded in deoxyribonucleic acid (DNA). In humans, the complete genome is contains of about 100,000 genes located on 24 chromosomes (The Human Genome, T. Strachan, BIOS Scientific Publishers, 1992). Each gene codes for a specific protein, which after its expression via transcription and translation, fulfills a specific biochemical function within a living cell. Changes in a DNA sequence are known as mutations and can result in proteins with altered or in some cases even lost biochemical activities; this in turn can cause genetic disease. Mutations include nucleotide deletions, insertions or alterations (i.e. point mutations). Point mutations can be either “missense”, resulting in a change in the amino acid sequence of a protein or “nonsense” coding for a stop codon and thereby leading to a truncated protein.

[0006] More than 3000 genetic diseases are currently known (Human Genome Mutations, D. N. Cooper and M. Kraiwczak, BIOS Publishers, 1993), including hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), Huntington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF). In addition to mutated genes, which result in genetic disease, certain birth defects are the result of chromosomal abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's Syndrome), Monosomy X (Turner's Syndrome) and other sex chromosome aneuploidies such as Klienfelter's Syndrome (XXY). Further, there is growing evidence that certain DNA sequences may predispose an individual to any of a number of diseases such as diabetes, arteriosclerosis, obesity, various autoimmune diseases and cancer (e.g., colorectal, breast, ovarian, lung).

[0007] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are different from the sequences contained in the host cell. Therefore, infectious organisms can also be detected and identified based on their specific DNA sequences.

[0008] Since the sequence of about 1 6 nucleotides is specific on statistical grounds even for the size of the human genome, relatively short nucleic acid sequences can be used to detect normal and defective genes in higher organisms and to detect infectious microorganisms (e.g., bacteria, fungi, protists and yeast) and viruses. DNA sequences can even serve as a fingerprint for detection of different individuals within the same species (see, Thompson, J. S. and M. W. Thompson, eds., Genetics in Medicine, W. B. Saunders Co., Philadelphia, Pa. (1991)).

[0009] Several methods for detecting DNA are currently being used. For example, nucleic acid sequences can be identified by comparing the mobility of an amplified nucleic acid fragment with a known standard by gel electrophoresis, or by hybridization with a probe, which is complementary to the sequence to be identified. Identification, however, can only be accomplished if the nucleic acid fragment is labeled with a sensitive reporter function (e.g., radioactive (³²P, ³⁵S), fluorescent or chemiluminescent). Radioactive labels can be hazardous and the signals they produce decay over time. Non-isotopic labels (e.g., fluorescent) suffer from a lack of sensitivity and fading of the signal when high intensity lasers are being used. Additionally, performing labeling, electrophoresis and subsequent detection are laborious, time-consuming and error-prone procedures. Electrophoresis is particularly error-prone, since the size or the molecular weight of the nucleic acid cannot be directly correlated to the mobility in the gel matrix. It is known that sequence specific effects, secondary structure and interactions with the gel matrix are causing artefacts.

[0010] 2. Use of mass spectrometry for detection and identification of nucleic acids

[0011] Mass spectrometry provides a means of “weighing” individual molecules by ionizing the molecules in vaccuo and making them “fly” by volatilization. Under the influence of combinations of electric and magnetic fields, the ions follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low molecular weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and characterization of organic molecules by the determination of the mass of the parent molecular ion. In addition, by arranging collisions of this parent molecular ion with other particles (e.g., argon atoms), the molecular ion is fragmented forming secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very often allows the derivation of detailed structural information. Many applications of mass spectrometric methods are known in the art, particularly in biosciences (see, e.g., Methods in Enzymol., Vol. 193: “Mass Spectrometry” (J. A. McCloskey, editor), 1990, Academic Press, New York).

[0012] Because of the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, accuracy of mass measurements, detailed structural information by CID in conjunction with an MS/MS configuration and speed, as well as on-line data transfer to a computer, there has been interest in the use of mass spectrometry for the structural analysis of nucleic acids. Recent reviews summarizing this field include K. H. Schram, “Mass Spectrometry of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry” 34, 203-287 (1990); and P. F. Crain, “Mass Spectrometric Techniques in Nucleic Acid Research,” Mass Spectrometry Reviews 9, 505-554 (1990); see, also U.S. Pat. No. 5,547,835 and U.S. Pat. No. 5,622,824).

[0013] Nucleic acids, however, are very polar biopolymers that are very difficult to volatilize. Consequently, mass spectrometric detection has been limited to low molecular weight synthetic oligonucleotides for confirming an already known oligonucleotide sequence by determining the mass of the parent molecular ion, or alternatively, confirming a known sequence through the generation of secondary ions (fragment ions) via CID in an MS/MS configuration using, in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass spectrometry) or plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis of protected dimeric blocks for chemical synthesis of oligodeoxynucleotides has been described (Köster et al. (1987) Biomed. Environ. Mass Spectrometry 14, 111-116).

[0014] Other ionization/desorption techniques include electrospray/ion-spray (ES) and matrix-assisted laser desorption/ionization (MALDI). ES mass spectrometry has been introduced by Fenn et al. (J. Phys. Chem. 88:4451-59 (1984); PCT Application No. WO 90/14148) and current applications are summarized in review articles (see, e.g., Smith et al. (1990) Anal. Chem. 62:882-89 and Ardrey (1992) Electrospray Mass Spectrometry, Spectroscopy Europe 4:10-18). The molecular weights of a tetradecanucleotide (see, Covey et al. (1 988) The “Determination of Protein, Oligonucleotide and Peptide Molecular Weights by Ionspray Mass Spectrometry,” Rapid Commun. in Mass Spectrometry 2:249-256), and of a 21-mer (Methods in Enzymol., 193, “Mass Spectrometry” (McCloskey, editor), p. 425, 1990, Academic Press, New York) have been published. As a mass analyzer, a quadrupole is most frequently used. Because of the presence of multiple ion peaks that all could be used for the mass calculation, the determination of molecular weights in femtomole amounts of sample is very accurate.

[0015] MALDI mass spectrometry, in contrast, can be attractive when a time-of-flight (TOF) configuration (see, Hillenkamp et al. (1990) pp 49-60 in “Matrix Assisted UV-Laser Desorption/lonization: A New Approach to Mass Spectrometry of Large Biomolecules,” Biological Mass Spectrometry, Burlingame and McCloskey, editors, Elsevier Science Publishers, Amsterdam) is used as a mass analyzer. Since, in most cases, no multiple molecular ion peaks are produced with this technique, the mass spectra, in principle, look simpler compared to ES mass spectrometry.

[0016] Although DNA molecules up to a molecular weight of 410,000 daltons have been desorbed and volatilized (Williams et al., “Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation of Frozen Aqueous Solutions,” Science 246, 1585-87 (1989)), this technique had only shown very low resolution (oligothymidylic acids up to 18 nucleotides, Huth-Fehre et al. Rapid Commun. in Mass Spectrom., 6, 209-13 (1992); DNA fragments up to 500 nucleotides in length K. Tang etal., Rapid Commun. in Mass Spectrom., 8, 727-730 (1994); and a double-stranded DNA of 28 base pairs (Williams et al., “Time-of-Flight Mass Spectrometry of Nucleic Acids by Laser Ablation and Ionization from a Frozen Aqueous Matrix,” Rapid Commun. in Mass Spectrom., 4, 348-351 (1990)). Japanese Patent No. 59-131909 describes an instrument, which detects nucleic acid fragments separated either by electrophoresis, liquid chromatography or high speed gel filtration. Mass spectrometric detection is achieved by incorporating into the nucleic acids, atoms, such as S, Br, I or Ag, Au, Pt, Os, Hg, that normally do not occur in DNA.

[0017] Co-owned U.S. Pat. No. 5,622,824 describes methods for DNA sequencing based on mass spectrometric detection. To achieve this, the DNA is by means of protection, specificity of enzymatic activity, or immobilization, unilaterally degraded in a stepwise manner via exonuclease digestion and the nucleotides or derivatives detected by mass spectrometry. Prior to the enzymatic degradation, sets of ordered deletions that span a cloned DNA fragment can be created. In this manner, mass-modified nucleotides can be incorporated using a combination of exonuclease and DNA/RNA polymerase. This permits either multiplex mass spectrometric detection, or modulation of the activity of the exonuclease so as to synchronize the degradative process. Co-owned U.S. Pat. Nos. 5,605,798 and 5,547,835 provide methods for detecting a particular nucleic acid sequence in a biological sample. Depending on the sequence to be detected, the processes can be used, for example, in methods of diagnosis. These methods, while broadly useful and applicable to numerous embodiments, represent the first disclosure of such applications and can be improved upon.

[0018] Therefore, it is an object herein to provided improved methods for sequencing and detecting DNA molecules in biological samples. It is also an object herein to provided improved methods for diagnosis of genetic diseases, predispositions to cetain diseases, cancers, and infections.

SUMMARY OF THE INVENTION

[0019] Methods of diagnosis by detecting and/or determing sequences of nucleic acids that are based on mass spectrometry are provided herein. Methods are provided for detecting double-stranded DNA, detecting mutations and other diagnostic markers using MS analysis. In particular, methods for diagnosing neuroblastoma, detecting heredity relationships, HLA compatibility, genetic fingerprinting, detecting teleromase activity for cancer diagnosis are provided.

[0020] In certain embodiments the DNA is immobilized on a solid support either directly or via a linker and/or bead. Three permutions of the methods for DNA detection in which immobilized DNA is used are exemplified. These include: (1) immobilization of a template; hybridization of the primer; extension of the primer, or extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing); (2) immobilization of a primer; hybridization of a single stranded template; and extension of the primer, or extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing); (3) immobilization of the primer; hybridization of a double stranded template; extension of the primer, or extension of the primer (single ddNTP) for sequencing or diagnostics or extension of the primer and Endonuclease degradation (sequencing).

[0021] In certain embodiments the DNA is immobilized on the support via a selectively cleavable linker. Selectively cleavable linkers include, buta are not limited to photocleavable linkers, chemically cleavable linkers and an enzymatically (such as a restriction site (nucleic acid linker), a protease site) cleavable linkers. Inclusion of a selectively cleavable linker expands the capabilities of the MALDI-TOF MS analysis because it allows for all of the permutations of immobilization of DNA for MALDI-TOF MS, the DNA linkage to the support through the 3′- or 5′-end of a nucleic acid; allows the amplified DNA or the target primer to be extended by DNA synthesis; and further allows for the mass of the extended product (or degraded product via exonuclease degradation) to be of a size that is appropriate for MALDI-TOF MS analysis (i.e., the isolated or synthesized DNA can be large and a small primer or a large primer sequence can be used and a small restriction fragment of a gene or single strand thereof hybridized thereto).

[0022] In a preferred embodiment, the selectively cleavable linker is a chemical or photocleavable linker that is cleaved during the ionizing step of mass spectrometry. Exemplary linkers include linkers containing, a disulfide group, a leuvinyl group, an acid-labile trityl group and a hydrophobic trityl group. In other embodiments, the enzymatically cleavable linker can be a nucleic acid that is an RNA nucleotide or that encodes a restriction endonuclease site. Other enzymatically cleavable linkers include linkers that contain a pyrophosphate group, an arginine-arginine group and a lysine-lysine group. Other linkers are exemplified herein.

[0023] Methods for sequencing long fragments of DNA are provided. To perform such sequencing, specific base terminated fragments are generated from a target nucleic acid. The analysis of fragments rather than the full length nucleic acid shifts the mass of the ions to be determined into a lower mass range, which is generally more amenable to mass spectometric detection. For example, the shift to smaller masses increases mass resolution, mass accuracy and, in particular, the sensitivity for detection. Hybridization events and the actual molecular weights of the fragments as determined by mass spectrometry provide sequence information (e.g., the presence and/or identity of a mutation). In a preferred embodiment, the fragments are captured on a solid support prior to hybridization and/or mass spectrometry detection. In another preferred embodiment, the fragments generated are ordered to provide the sequence of the larger nucleic acid.

[0024] One preferred method for generating base specifically terminated fragments from a nucleic acid is effected by contacting an appropriate amount of a target nucleic acid with an appropriate amount of a specific endonuclease, thereby resulting in partial or complete digestion of the target nucleic acid. Endonucleases will typically degrade a sequence into pieces of no more than about 50-70 nucleotides, even if the reaction is not run to full completion. In a preferred embodiment, the nucleic acid is a ribonucleic acid and the endonuclease is a ribonuclease (RNase) selected from among: the G-specific RNase T₁, the A-specific RNase U₂, the A/U specific RNase PhyM, U/C specific RNase A, C specific chicken liver RNase (RNase CL3) or crisavitin. In another preferred embodiment, the endonuclease is a restriction enzyme that cleaves at least one site contained within the target nucleic acid. Another preferred method for generating base specifically terminated fragments includes performing a combined amplification and base-specific termination reaction (e.g., using an appropriate amount of a first DNA polymerase, which has a relatively low affinity towards the chain-terminating nucleotides resulting in an exponential amplification of the target; and a polymerase with a relatively high affinity for the chain terminating nucleotide resulting in base-specific termination of the polymerization. Inclusion of a tag at the 5′ and/or 3′ end of a target nucleic acid can facilitates the ordering of fragments.

[0025] Methods for determining the sequence of an unknown nucleic acid in which the 5′ and/or 3′ end of the target nucleic acid can include a tag are provided. Inclusion of a non-natural tag on the 3′ end is also useful for ruling out or compensating for the influence of 3′ heterogeneity, premature termination and nonspecific elongation. In a preferred embodiment, the tag is an affinity tag (e.g., biotin or a nucleic acid that hybridizes to a capture nucleic acid). Most preferably the affinity tag facilitates binding of the nucleic acid to a solid support. In another preferred embodiment, the tag is a mass marker (i.e. a marker of a mass that does not correspond to the mass of any of the four nucleotides). In a further embodiment, the tag is a natural tag, such as a polyA tail or the natural 3′ heterogeneity that can result, for example, from a transcription reaction.

[0026] Methods of sequence analysis in which nucleic acids have been replicated from a nucleic acid molecule obtained from a biological sample are specifically digested using one or more nucleases (deoxyribonucleases for DNA, and ribonucleases for RNA) are provided. The fragments captured on a solid support carrying the corresponding complementary sequences. Hybridization events and the actual molecular weights of the captured target sequences provide information on mutations in the gene. The array can be analyzed spot-by-spot using mass spectrometry. Further, the fragments generated can be ordered to provide the sequence of the larger target fragment.

[0027] In another embodiment, at least one primer with a 3′-terminal base is hybridized to the target nucleic acid near a site where possible mutations are to be detected. An appropriate polymerase and a set of three nucleoside triphosphates (NTPs) and the fourth added as a terminator are reacted. The extension reaction products are measured by mass spectrometry and are indicative of the presence and the nature of a mutation. The set of three NTPs and one dd-NTP (or three NTPs and one 3′-deoxy NTP), will be varied to be able to discriminate between several mutations (including compound heterozygotes) in the target nucleic acid sequence.

[0028] Methods for detecting and diagnosing neoplasia/malagnancies in a tissue or cell sample are provided. The methods rely on a telomeric repeat amplification protocol (TRAP)-MS assay and include the steps of:

[0029] a) obtaining a tissue or a cell sample, such as a clinical isolate or culture of suspected cells;

[0030] b) isolating/extracting/purifying telomerase from the sample;

[0031] c) adding the telomerase extract to a composition containing a synthetic DNA primer, which is optionally immobilized, complementary to the telomeric repeat, and all four dNTPs under conditions that result in telomerase specific extension of the synthetic DNA;

[0032] d) amplifying the telomerase extended DNA products,, preferably using a primer that contains a “linker moiety”, such as a moiety based on thiol chemistry or streptavidin;

[0033] e) isolating linker-amplified primers, such as by using a complementary binding partner immobilized on a solid support;

[0034] f) optionally conditioning the DNA for crystal formation; and

[0035] g) performing MS by ionizing/volatizing the sample to detect the DNA product.

[0036] Telomerase-specific extension is indicative of neoplaisa/malignancy. This method can be used to detect ect specific malignancies. The use of MS to detect the DNA product permits identification the extended product, which is indicative of telomerase activity in the sample. If desired, the synthetic DNA can be in the form an array.

[0037] Methods for detecting mutations are provided and the use thereof oncogenes and to thereby screen for transformed cells, which are indicative of neoplasia. Detection of mutations present in oncogenes are indicative of transformation. This method includes the steps of:

[0038] a) obtaining a biological sample;

[0039] b) amplifying a portion of the selected proto-oncogene that includes a codon indicative of transformation, where one primer has a linker moiety for immobilization;

[0040] c) immobilizing DNA via the linker moiety to a solid support, optionally in the form of an array;

[0041] d) hybridizing a primer complementary to the proto oncogene sequence that is upstream from the codon

[0042] e) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP location;

[0043] f) ionizing/volatizing the sample; and

[0044] g) detecting the mass of the extended DNA, whereby mass indicates the presence of wild-type or mutant alleles. The presence of a mutant allele at the codon is diagnostic for neoplasia.

[0045] In an exemplary embodiment, extension-MS analysis is used detect the presence of a mutated codon 634 in the retrovirus (RET)-proto oncogene.

[0046] In another embodiment, methods for diagnosing diseases using reverse transcription and amplification of a gene expressed in transformed cells. In particular, a method for diagnosis of neuroblastoma using reverse transcriptase (RT)-MS of tyrosine hydroxylase, which is a catecholamine biosynthetic enzyme that expressed in tumor cells, but not in tumor cells but not normal cells, such as normal bone marrow cells is provided. The method includes the steps of:

[0047] a) obtaining a tissue sample;

[0048] b) isolating polyA RNA from the sample;

[0049] c) preparing a cDNA library using reverse transcription;

[0050] d) amplifing a cDNA product, or portion thereof, of the selected gene, where one oligo primer has a linker moiety;

[0051] e) isolating the amplified product by immobilizing the DNA to solid support via the linker moiety;

[0052] f) optionally conditioning the DNA;

[0053] g) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the selected gene gene. For example, expression of the tyrosine hydroxylase gene is indicative of neuroblastoma.

[0054] Also provided are methods of directly detecting a double-stranded nucleic acid using MALDI-TOF MS. These methods include the steps of:

[0055] a) isolating a double stranded DNA of an appropriate size for MS via amplification methods or formed by hybridization of single-stranded DNA fragment;

[0056] b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA in which the conditions include one or all of the following: preparing samples for analysis at reduced temperatures (i.e. 4° C.), and using of higher DNA concentrations in the matrix to drive duplex formation

[0057] c) ionizing/volatizing the sample of step b), where this step uses low acceleration voltage of the ions to assist in maintaining duplex DNA by, for example, adjusting laser power to just above threshold irradiation for ionization, and

[0058] d) detecting the presence of the dsDNA of the appropriate mass.

[0059] In preferred embodiments, the matrix includes 3-hydroxypicolinic acid. The detected DNA can be indicative of a genetic disorder, genetic disease, genetic predisposition to a disease chromosomal abnormalities. In other embodiments, the mass of the double stranded DNA is indicative of the deletion, insertion, mutation.

[0060] A method designated primer oligo base extension (PROBE) is provided. This method uses a single detection primer followed by an oligonucleotide extension step to give products, which can be readily resolved by MALDI-TOF mass spectrometry. The products differ in length by a number of bases specific for a number of repeat units or for second site mutations within the repeated region. The method is exemplified using as a model system the AluVpA polymorphism in intron 5 of the interferon-α receptor gene located on human chromosome 21, and the poly T tract of the splice acceptor site of intron 8 from the CFTR gene located on human chromosome 7. The method is advantageously used for example, for determining identity, identifying mutations, familial relationship, HLA compatability and other such markers ,using PROBE-MS analysis of microsatellite DNA. In a preferred embodiment, the method includes the steps of:

[0061] a) obtaining a biological sample from two individuals;

[0062] b) amplifying a region of DNA from each individual that contains two or more microsatellite DNA repeat sequences

[0063] c) ionizing/volatizing the amplified DNA;

[0064] d) detecting the presence of the amplified DNA and comparing the molecular weight of the amplified DNA. Different sizes are indicative of non-identity (i.E. wild-type versus mutation), non-heredity or non-compatibility; similar size fragments indicate the possibility identity, of familial relationship, or HLA compatibility.

[0065] More than one marker may be examined simulataneoulsy, primers with different linker moieties are used for imimobilization.

[0066] Another method loop-primer oligo base extension, designated LOOP-PROBE, for detection of mutations especially predominant disease causing mutations or common polymorphisms is provided. In a particular embodiment, this method for detecting target nucleic acid in a sample, includes the steps of:

[0067] a) amplifying a target nucleic acid sequence, such as 8-globin, in a sample, using (i) a first primer whose 5′-end shares identity to a portion of the target DNA immediately downstream from the targeted codon followed by a sequence that introduces a unique restriction endonuclease site, such as Cfol in the case of fi-globin, into the amplicon and whose 3′-end primer is self-complementary; and (ii) a second downstream primer that contains a tag, such as biotin, for immobilizing the DNA to a solid support, such as streptavidin beads;

[0068] c) immobilizing the double-stranded amplified DNA to a solid support via the linker moiety;

[0069] d) denaturing the immobilized DNA and isolating the non-immobilized DNA strand;

[0070] e) annealing the intracomplementary sequences in the 3′-end of the isolated non-immobilzed DNA strand, such that the 3′-end is extendable by a polymerase, which annealing can be performed, for example, by heating then and cooling to about 37° C., or other suitable method;

[0071] f) extending the annealed DNA by adding DNA polymerase, 3 dNTPs/1 ddNTP, whereby the 3′-end of the DNA strand is extended by the DNA polymerase to the position of the next ddNTP location (i.e., to the mutation location);

[0072] g) cleaving the extended double stranded stem loop DNA with the unique restriction endonuclease and removing the cleaved stem loop DNA

[0073] i) (optionally adding a matrix) ioni:zing/volatizing the extended product; and

[0074] j) detecting the presence of the extended target nucleic acid, whereby the presence of a DNA fragment of a mass different from wild-type is indicative of a mutation at the target codon(s).

[0075] This method eliminates one specific reagent for mutation detection compared other methods of MS mutational analyses, thereby simplifying the process and rendering it amenable to automation. Also, the specific extended product that is analyzed is cleaved from the primer and is therefore shorter compared to the other methods. In addition, the annealing efficiency is higher compared to annealing of an added primer and should therefore generate more product. The process is compatible with multiplexing and various detection schemes (e.g., single base extension, oligo base extension and sequencing). For example, the extension of the loop-primer can be used for generation of short diagnostic sequencing ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as well as species typing.

[0076] In another emodiment, a methods of detecting a target nucleic acid in a biological sample using RNA amplification is provided. In the method, the target is amplified the target nucleic acid, using a primer that shares a region complementary to the target sequence and upstream encodes a promoter, such as the T7 promoter. A DNA-dependent RNA polymerase and appropriate ribonucleotides are added to synthesize RNA, which is analyzed by MS.

[0077] Improved methods of sequencing DNA using MS are provided. In these methods thermocycling for amplification is used prior to MS analysis, thereby increasing the signal.

[0078] Also provide are primers for use in MS analyses. In particular, primers, comprising all or, for longer oligonucleotides, at least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences set forth in SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128, 129, and primers set forth in SEQ ID Nos. 280-287. The primers are unlabeled, and optionally include a mass modifying moiety, which is preferably attached to the 5′end.

[0079] Other features and advantages of the methods provided herein will be further described with reference to the following Figures, Detailed Description and Claims.

BRIEF DESCRIPTION OF THE FIGURES

[0080]FIG. 1A is a diagram showing a process for performing mass spectrometric analysis on one target detection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen to specifically hybridize with a complementary sequence on the target nucleic acid molecule (T), known as the target capture site (TCS). The spacer (S) facilitates unhindered hybridization. A detector nucleic acid sequence (D), which is complementary to the TDS is then contacted with the TDS. Hybridization between D and the TDS can be detected by mass spectrometry.

[0081]FIG. 1B is a diagram showing a process for performing mass spectrometric analysis on at least one target detection site (here TDS 1 and TDS 2) via direct linkage to a solid support. The target sequence (T) containing the target detection site (TDS 1 and TDS 2) is immobilized to a solid support via the formation of a reversible or irreversible bond formed between an appropriate functionality (L′) on the target nucleic acid molecule (T) and an appropriate functionality (L) on the solid support. Detector nucleic acid sequences (here D1 and D2), which are complementary to a target detection site (TDS 1 or TDS 2) are then contacted with the TDS. Hybridization between TDS 1 and D1 and/or TDS 2 and D2 can be detected and distinguished based on molecular weight differences.

[0082]FIG. 1C is a diagram showing a process for detecting a wildtype (D^(wt)) and/or a mutant (D^(mut)) sequence in a target (T) nucleic acid molecule. As in FIG. 1A, a specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). In addition, the capture sequence is chosen to specifically interact with a complementary sequence on the target sequence (T), the target capture site (TCS) to be detected through hybridization. If the target detection site (TDS) includes a mutation, X, etection sites can be distinguished from wildtype by mass spectrometry. Preferably, the detector nucleic acid molecule (D) is designed so that the mutation is in the middle of the molecule and therefore would not lead to a stable hybrid if the wildtype detector oligonucleotide (D^(wt)) is contacted with the target detector sequence, e.g., as a control. The mutation can also be detected if the mutated detector oligonucleotide (D^(mut)) with the matching base at the mutated position is used for hybridization. If a nucleic acid molecule obtained from a biological sample is heterozygous for the particular sequence (i.e. contain D^(wt) and D^(mut)), D^(wt) and D^(mut) will be bound to the app and D^(mut) to be detected simultaneously.

[0083]FIG. 2 is a diagram showing a process in which several mutations are simultaneously detected on one target sequence molecular weight differences between the detector oligonucleotides D1, D2 and D3 must be large enough so that simultaneous detection (multiplexing) is possible. This can be achieved either by the sequence itself (composition or length) or by the introduction of mass-modiifying functionalities M1-M3 into the detector oligonucleotide.

[0084]FIG. 3 is a diagram showing still another multiplex detection format. In this embodiment, differentiation is accomplished by employing different specific capture sequences which are position-specifically immobilized on a flat surface (e.g., a ‘chip array’). If different target sequences T1-Tn are present, their target capture sites TCS1-TCSn will interact with complementary immobilized capture sequences C1-Cn. Detection is achieved by employing appropriately mass differentiated detector oligonucleotides D1-Dn, which are mass differentiated either by their sequences or by mass modifying functionalities M1-Mn.

[0085]FIG. 4 is a diagram showing a format wherein a predesigned target capture site (TCS) is incorporated into the target sequence using nucleic acid (i.e., PCR) amplification. Only one strand is captured, the other is removed (e.g., based on the interaction between biotin and streptavidin coated magnetic beads). If the biotin is attached to primer 1 the other strand can be appropriately marked by a TCS. Detection is as described above through the interaction of a specific detector oligonucleotide D with the corresponding target detection site TDS via mass spectrometry.

[0086]FIG. 5 is a diagram showing how amplification (here ligase chain reaction (LCR)) products can be prepared and detected by mass spectrometry. Mass differentiation can be achieved by the mass modifying functionalities (M1 and M2) attached to primers (P1 and P4 respectively). Detection by mass spectrometry can be accomplished directly (i.e. without employing immobilization and target capturing sites (TCS)). Multiple LCR reactions can be performed in parallel by providing an ordered array of capturing sequences (C). This format allows separation of the ligation products and spot by spot identification via mass spectrometry or multiplexing if mass differentiation is sufficient.

[0087]FIG. 6A is a diagram showing mass spectrometric analysis of a nucleic acid molecule, which has been amplified by a transcription amplification procedure. An RNA sequence is captured via its TCS sequence, so that wildtype and mutated target detection sites can be detected as above by employing appropriate detector oligonucleotides (D).

[0088]FIG. 6B is a diagram showing multiplexing to detect two different (mutated) sites on the same RNA in a simultaneous fashion using mass-modified detector oligonucleotides M1-D1 and M2-D2.

[0089]FIG. 6C is a diagram of a different multiplexing procedure for detection of specific mutations by employing mass modified dideoxynucleoside or 3′-deoxynucleoside triphosphates and an RNA dependent DNA polymerase. Alternatively, DNA dependent RNA polymerase and ribonucleotide phosphates can be employed. This format allows for simultaneous detection of all four base possibilities at the site of a mutation (X).

[0090]FIG. 7A is a diagram showing a process for performing mass spectrometric analysis on one target detection site (TDS) contained within a target nucleic acid molecule (T), which has been obtained from a biological sample. A specific capture sequence (C) is attached to a solid support (SS) via a spacer (S). The capture sequence is chosen to specifically hybridize with a complementary sequence on T known as the target capture site (TCS). A nucleic acid molecule that is complementary to a portion of the TDS is hybridized to the TDS 5′ of the site of a mutation (X) within the TDS. The addition of a complete set of dideoxynucleosides or 3′-deoxynucleoside triphosphates (e.g., pppAdd, pppTdd, pppCdd and pppGdd) and a DNA dependent DNA or RNA polymerase allows for the addition only of the one dideoxynucleoside or 3′-deoxynucleoside triphosphate that is complementary to X.

[0091]FIG. 7B is a diagram showing a process for performing mass spectrometric analysis to determine the presence of a mutation at a potential mutation site (M) within a nucleic acid molecule. This format allows for simultaneous analysis of alleles (A) and (B) of a double stranded target nucleic acid molecule, so that a diagnosis of homozygous normal, homozygous mutant or heterozygous can be provided. Allele A and B are each hybridized with complementary oligonucleotides ((C) and (D) respectively), that hybridize to A and B within a region that includes M. Each heteroduplex is then contacted with a single strand specific endonuclease, so that a mismatch at M, indicating the presence of a mutation, results in the cleavage of (C) and/or (D), which can then be detected by mass spectrometry.

[0092]FIG. 8 is a diagram showing how both strands of a target DNA can be prepared for detection using transcription vectors having two different promoters at opposite locations (e.g., the SP6 and T7 promoter). This format is particularly useful for detecting heterozygous target detections sites (TDS). Employing the SP6 or the T7 RNA polymerase both strands could be transcribed separately or simultaneously. The transcribed RNA molecules can be specifically captured and simultaneously detected using appropriately mass-differentiated detector oligonucleotides. This can be accomplished either directly in solution or by parallel processing of many target sequences on an ordered array of specifically immobilized capturing sequences.

[0093]FIG. 9 is a diagram showing how RNA prepared as described in FIGS. 6, 7 and 8 can be specifically digested using one or more ribonucleases and the fragments captured on a solid support carrying the corresponding complementary sequences. Hybridization events and the actual molecular weights of the captured target sequences provide information on whether and where mutations in the gene are present. The array can be analyzed spot by spot using mass spectrometry. DNA can be similarly digested using a cocktail of nucleases including restriction endonucleases. Mutations can be detected by different molecular weights of specific, individual fragments compared to the molecular weights of the wildtype fragments.

[0094]FIG. 10A shows UV spectra resulting from the experiment described in the following Example 1. Panel i) shows the absorbance of the 26-mer before hybridization. Panel ii) shows the filtrate of the centrifugation after hybridization. Panel iii) shows the results after the first wash with 50 mM ammonium citrate. Panel iv) shows the results after the second wash with 50 mM ammonium citrate.

[0095]FIG. 10B shows a mass spectrum resulting from the experiment described in the following Example 1 after three washing/centrifugation steps.

[0096]FIG. 10C shows a mass spectrum resulting from the experiment described in the following Example 1 showing the successful desorption of the hybridized 26-mer off of beads in accordance with the format depicted schematically in FIG. 1B.

[0097]FIG. 11 shows a mass spectrum resulting from the experiment described in the following Example 1 showing the giving proof of an experiment as schematically depicted in FIG. 1B successful desorption of the hybridized 40-mer. The efficiency of detection suggests that fragments much longer than 40-mers can also be desorbed. FIG. 12 shows a mass spectrum resulting from the experiment described in the following Example 2 showing the successful desorption and differentiation of an 18-mer and 19-mer by electrospray mass spectrometry, the mixture (top), peaks resulting from 18-mer emphasized (middle) and peaks resulting from 19-mer emphasized (bottom).

[0098]FIG. 13 is a graphic representation of the process for detecting the Cystic Fibrosis mutation ΔF508 as described in Example 3.

[0099]FIG. 14 is a mass spectrum of the DNA extension product of a ΔF508 homozygous normal of Example 3.

[0100]FIG. 15 is a mass spectrum of the DNA extension product of a ΔF508 heterozygous mutant of Example 3.

[0101]FIG. 16 is a mass spectrum of the DNA extension product of a ΔF508 homozygous normal of Example 3.

[0102]FIG. 17 is a mass spectrum of the DNA extension product of a ΔF508 homozygous mutant of Example 3.

[0103]FIG. 18 is a mass spectrum of the DNA extension product of a ΔF508 heterozygous mutant of Example 3.

[0104]FIG. 19 is a graphic representation of various processes for performing apolipoprotein E genotyping of Example 4.

[0105]FIG. 20 shows the nucleic acid sequence of normal apolipoprotein E (encoded by the E3 allele, FIG. 20B) and other isotypes encoded by the E2 and E4 alleles (FIG. 20A).

[0106]FIG. 21A shows a composite restriction pattern for various genotypes of apolipoprotein E using the Cfol restriction endonuclease.

[0107]FIG. 21B shows the restriction pattern obtained in a 3.5% MetPhor Agarose Gel for various genotypes of apolipoprotein E.

[0108]FIG. 21C shows the restriction pattern obtained in a 12% polyacrylamide gel for various genotypes of apolipoprotein E.

[0109]FIG. 22A is a chart showing the molecular weights of the 91, 83, 72, 48 and 35 base pair fragments obtained by restriction enzyme cleavage of the E2, E3 and E4 alleles of apolipoprotein E.

[0110]FIG. 22B is the mass spectrum of the restriction product of a homozygous E4 apolipoprotein E genotype.

[0111]FIG. 23A is the mass spectrum of the restriction product of a homozygous E3 apolipoprotein E genotype.

[0112]FIG. 23B is the mass spectrum of the restriction product of a E3/E4 apolipoprotein E genotype.

[0113]FIG. 24 is an autoradiograph of Example 5 of a 7.5% polyacrylamide gel in which 10% (5 μl) of each amplified sample was loaded: sample M: pBR322 Alul digested; sample 1: HBV positive in serological analysis; sample 2: also HBV positive; sample 3: without serological analysis but with an increased level of transaminases, indicating liver disease; sample 4: HBV negative containing HCV; sample 5: HBV posit-) negative control; (+) positive control). Staining was done with ethidium bromide.

[0114]FIG. 25A is a mass spectrum of sample 1, which is HBV positive. The signal at 20754 Da represents the HBV related amplification product (67 nucleotides, calculated mass: 20735 Da). The mass signal at 10390 Da represents the [M+2H]²⁺ molecule ion (calculated: 10378 Da).

[0115]FIG. 25B is a mass spectrum of sample 3, which is HBV negative corresponding to nucleic acid (i.e., PCR), serological and dot blot based assays. The amplified product is generated only in trace amounts. Nevertheless it is unambiguously detected at 20751 Da (calculated mass: 20735 Da). The mass signal at 10397 Da represents the [M+2H]²⁺ molecule ion (calculated: 10376 Da).

[0116]FIG. 25C is a mass spectrum of sample 4, which is HBV negative, but HCV positive. No HBV specific signals were observed.

[0117]FIG. 26 shows a part of the E. coli lacl gene with binding sites of the complementary oligonucelotides used in the ligase chain reaction (LCR) of Example 6. Here the wildtype sequence is displayed. The mutant contains a point mutation at bp 191 which is also the site of ligation (bold). The mutation is a C to T transition (G to A, respectively). This leads to a T-G mismatch with oligo B (and A-C mismatch with oligo C, respectively).

[0118]FIG. 27 is a 7.15% polyacrylamide gel of Example 6 stained with ethidium bromide. M: chain length standard (pUC19DNA, Mspl digested). Lane 1: LCR with wildtype template. Lane 2: LCR with mutant template. Lane 3: (control) LCR without template. The ligation product (50 bp) was only generated in the positive reaction containing wildtype template.

[0119]FIG. 28 is an HPLC chromatogram of two pooled positive LCRs.

[0120]FIG. 29 shows an HPLC chromatogram the same conditions but mutant template were used. The small signal of the ligation product is due to either template-free ligation of the educts or to a ligation at a (G-T, A-C) mismatch. The ‘false positive’ signal is significantly lower than the signal of ligation product with wildtype template depicted in FIG. 28. The analysis of ligation educts leads to ‘double-peaks’ because two of the oligonucleotides are 5′-phosphorylated.

[0121]FIG. 30 In (b) the complex signal pattern obtained by MALDI-TOF-MS analysis of Pfu DNA-ligase solution of Example 6 is depicted. In (a) a MALDI-TOF-spectrum of an unpurified LCR is shown. The mass signal 67569 Da probably represents the Pfu DNA ligase.

[0122]FIG. 31 shows a MALDI-TOF spectrum of two pooled positive LCRs (a). The signal at 7523 Da represents unligated oligo A (calculated: 7521 Da) whereas the signal at 15449 Da represents the ligation product (calculated: 15450 Da). The signal at 3774 Da is the [M+2H]²⁺ signal of oligo A. The signals in the mass range lower than 2000 Da are due to the matrix ions. The spectrum corresponds to lane 1 in FIG. 27 and the chromatogram in FIG. 28. In (b) a spectrum of two pooled negative LCRs (mutant template) is shown. The signal at 7517 Da represents oligo A (calculated: 7521 Da).

[0123]FIG. 32 shows a spectrum of two pooled control reactions (with salmon sperm DNA as template). The signals in the mass range around 2000 Da are due to Tween20, only oligo A could be detected, as expected.

[0124]FIG. 33 shows a spectrum of two pooled positive LCRs (a). The purification was done with a combination of ultrafiltration and streptavidin DynaBeads as described in the text. The signal at 1 5448 Da represents the ligation product (calculated: 15450 Da). The signal at 7527 represents oligo A (calculated: 7521 Da). The signals at 3761 Da is the [M+2H]²⁺ signal of oligo A, whereas the signal at 5140 Da is the [M+3H]²⁺ signal of the ligation product. In (b) a spectrum of two pooled negative LCRs (without template) is shown. The signal at 7514 Da represents oligo A (calculated: 7521 Da).

[0125]FIG. 34 is a schematic presentation of the oligo base extension of the mutation detection primer as described in Example 7, using ddTTP (A) or ddCTP (B) in the reaction mix, respectively. The theoretical mass calculation is given in parenthesis. The sequence shown is part of the exon 10 of the CFTR gene that bears the most common cystic fibrosis mutation ΔF508 and more rare mutations ΔI507 as well as lIe506Ser.

[0126]FIG. 35 is a MALDI-TOF-MS spectrum recorded directly from precipitated oligo base extended primers for mutation detection. The spectrum in (A) and (B), respectively show the annealed primer (CF508) without further extension reaction. Panel C displays the MALDI-TOF spectrum of the wild type by using pppTdd in the extension reaction and D a heterozygotic extension products carrying the 506S mutation when using pppCdd as terminator. Panels E and F show a heterozygote with ΔF508 mutation with pppTdd and pppCdd as terminators in the extension reaction. Panels G and H represent a homozygous ΔF508 mutation with either pppTdd or pppCdd as terminators. The template of diagnosis is pointed out below each spectrum and the observed/expected molecular mass are written in parenthesis.

[0127]FIG. 36 shows the portion of the sequence of pRFcl DNA, which was used as template for nucleic acid amplification in Example 8 of unmodified and 7-deazapurine containing 99-mer and 200-mer nucleic acids as well as the sequences of the 19-mer forward primer and the two 18-mer reverse primers.

[0128]FIG. 37 shows the portion of the nucleotide sequence of M13mp18 RFI DNA, which was used in Example 8 for nucleic acid amplification of unmodified and 7-deazapurine containing 103-mer nucleic acids. Also shown are nucleotide sequences of the 17-mer primers used in the nucleic acid amplification reaction.

[0129]FIG. 38 shows the result of a polyacrylamide gel electrophoresis of amplified products described in Example 8 purified and concentrated for MALDI-TOF MS analysis. M: chain length marker, lane 1: 7-deazapurine containing 99-mer amplified product, lane 2: unmodified 99-mer, lane 3: 7-deazapurine containing 103-mer and lane 4: unmodified 103-mer amplified product.

[0130]FIG. 39: an autoradiogram of polyacrylamide gel electrophoresis of nucleic acid (i.e., PCR) reactions carried out with 5′-[³²P]-labeled primers 1 and 4. Lanes 1 and 2: unmodified and 7-deazapurine modified 103-mer amplified product (53321 and 23520 counts), lanes 3 and 4: unmodified and 7-deazapurine modified 200-mer (71123 and 39582 counts) and lanes 5 and 6: unmodified and 7-deazapurine modified 99-mer (173216 and 94400 counts).

[0131]FIG. 40 a) MALDI-TOF mass spectrum of the unmodified 103-mer amplified products (sum of twelve single shot spectra). The mean value of the masses calculated for the two single strands (31768 u and 31759 u) is 31763 u. Mass resolution: 18. b) MALDI-TOF mass spectrum of 7-deazapurine containing 103-mer amplified product (sum of three single shot spectra). The mean value of the masses calculated for the two single strands (31727 u and 31719 u) is 31723 u. Mass resolution: 67.

[0132]FIG. 41: a) MALDI-TOF mass spectrum of the unmodified 99-mer amplified product (sum of twenty single shot spectra). Values of the masses calculated for the two single strands: 30261 u and 30794 u. b) MALDI-TOF mass spectrum of 7-deazapurine containing 99-mer amplified product (sum of twelve single shot spectra). Values of the masses calculated for the two single strands: 30224 u and 30750 u.

[0133]FIG. 42: a) MALDI-TOF mass spectrum of the unmodified 200-mer amplified product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61873 u and 61595 u) is 61734 u. Mass resolution: 28. b) MALDI-TOF mass spectrum of 7-deazapurine containing 200-mer amplified product (sum of 30 single shot spectra). The mean value of the masses calculated for the two single strands (61772 u and 61714 u) is 61643 u. Mass resolution: 39.

[0134]FIG. 43: a) MALDI-TOF mass spectrum of 7-deazapurine containing 100-mer amplified product with ribomodified primers. The mean value of the masses calculated for the two single strands (30529 u and 31095 u) is 30812 u. b) MALDI-TOF mass spectrum of the amplified product after hydrolytic primer-cleavage. The mean value of the masses calculated for the two single strands (25104 u and 25229 u) is 25167 u. The mean value of the cleaved primers (5437 u and 5918 u) is 5677 u.

[0135]FIG. 44 A-D shows the MALDI-TOF mass spectrum of the four sequencing ladders obtained from a 39-mer template (SEQ ID No. 23), which was immobilized to streptavidin beads via a 3′ biotinylation. A 14-mer primer (SEQ ID NO. 24) was used in the sequencing according to Example 9.

[0136]FIG. 45 shows a MALDI-TOF mass spectrum of a solid phase sequencing of a 78-mer template (SEQ ID No. 25), which was immobilized to streptavidin beads via a 3′ biotinylation. A 18-mer primer (SEQ ID No. 26) and ddGTP were used in the sequencing.

[0137]FIG. 46 shows a scheme in which duplex DNA probes with single-stranded overhang capture specific DNA templates and also serve as primers for solid phase sequencing.

[0138]FIG. 47 A-D shows MALDI-TOF mass spectra obtained from a sequencing reaction using 5′ fluorescent labeled 23-mer (SEQ ID No. 29) annealed to a 3′ biotinylated 18-mer (SEQ ID No. 30), leaving a 5-base overhang, which captured a 15-mer template (SEQ ID No. 31) as described in Example 9.

[0139]FIG. 48 shows a stacking fluorogram of the same products obtained from the reaction described in FIG. 47, but run on a conventional DNA sequencer.

[0140]FIG. 49 shows a MALDI-TOF mass spectrum of the sequencing ladder using cycle sequencing as described in Example 1 generated from a biological amplified product as template and a 12mer (5′-TGC ACC TGA CTC-3′ (SEQ ID NO. 34)) sequencing primer. The peaks resulting from depurinations and peaks which are not related to the sequence are marked by an asterisk. MALDI-TOF MS measurements were taken on a reflectron TOF MS. A.) Sequencing ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; C.) Sequencing ladder stopped with ddGTP; D.) Sequencing ladder stopped with ddTTP.

[0141]FIG. 50 shows a schematic representation of the sequencing ladder generated in FIG. 49 with the corresponding calculated molecular masses up to 40 bases after the primer. For the calculation, the following masses were used: 3581.4Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and :328.2 Da for 7-deaza-dGTP.

[0142]FIG. 51 shows the sequence of the amplified 209 bp amplified product within the,β-globin gene, which was used as a template for sequencing. The sequences of the appropriate amplification primer and the location of the 12mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 bases after the primer. In a wildtype sequence this T would be replaced by an A.

[0143]FIG. 52 shows a sequence which is part of the intron 5 of the interferon-receptor gene that bears the AluVpA polymorphism as further described in Example 11. The scheme presents the primer oligo base extension (PROBE) using ddGTP, ddCTP, or both for termination, respectively. The polymorphism detection primer (IFN) is underlined, the termination nucleotides are marked in bold letters. The theoretical mass values from the alleles found in 28 unrelated individuals and a five member family are given in the table. Both second site mutations found in most 13 units allele, but not all, are indicated.

[0144]FIG. 53 shows the MALDI-TOF-MS spectra recorded directly form precipitated extended cyclePROBE reaction products. Family study using AluVpA polymorphism in intron 5 of the interferon-α receptor gene (Example 11).

[0145]FIG. 54 shows the mass spectra from PROBE products using ddC as termination nucleotide in the reaction mix. The allele with the molecular mass of approximately 11650 da from the DNA of the mother and child 2 is a hint to a second site mutation within one of the repeat units.

[0146]FIG. 55 shows a schematic presentation of the PROBE method for detection of different alleles in the polyT tract at the 3′-end of intron 8 of the CFTR gene with pppCdd as terminator (Example 11).

[0147]FIG. 56 shows the MALDI-TOF-MS spectra recorded directly from the precipitated extended PROBE reaction products. Detection of all three common alleles of the polyT tract at the 3′ end of Intron 8 of the CFTR gene. (a) T5/T9 heterozygous, (b) T7/T9 heterozygous (Example 11).

[0148]FIG. 57 shows a mass spectrum of the digestion of a 252-mer ApoE gene amplified product (ε3/ε3 genotype) as described in Example 12 using a) Cfol alone and b) Cfol plus Rsal. Asterisks: depurination peaks.

[0149]FIG. 58 shows a mass spectrum of the ApoE gene amplified product (ε3/ε3 genotype) digested by Cfol and purified by a) single and b) double ethanol/glycogen and c) double isopropyl alcohol/glycogen precipitations.

[0150]FIG. 59 shows a mass spectrum of the Cfol/Rsal digest products from a) ε2/ε3. b) ε3/ε3, c) ε3/ε4, and d) ε4/ε4 genotypes. Dashed lines are drawn through diagnostic fragments.

[0151]FIG. 60 shows a scheme for rapid identification of unknown ApoE genotypes following simultaneous digestion of a 252-mer apo E gene amplified product by the restriction enzymes Cfol and Rsal.

[0152]FIG. 61 shows the multiplex (codons 112 and 158) mass spectrum PROBE results for a) ε2/ε3, b) ε3/ε3, c) ε3/ε4, and d) ε4/ε4 genotypes. E: extension products; P: unextended primer. Top: codon 112 and 158 regions, with polymorphic sites; bold and primer sequences underlined.

[0153]FIG. 62 shows a mass spectrum of a TRAP assay to detect telomerase activity (Example 13). The spectrum shows two of the primer signals of the amplified product TS primer at 5,497.3 Da (calc. 5523 Da) and the biotinylated bioCX primer at 7,537.6 Da (calc. 7,537 Da) and the first telomerase-specific assay product containing three telomeric repeats at 12,775.8 Da (calc. 12,452 Da) its mass is larger by one dA nucleotide (12,765 Da) due to extendase activity of Taq DNA polymerase.

[0154]FIG. 63 depicts the higher mass range of FIG. 62, i.e. the peak at 12,775.6 Da represents the products with these telomeric repeats. The peaks at 20,322.1 Da is the result of a telomerase activity to form seven telomeric repeats (calc. 20,395 Da including the extension by one dA nucleotide). The peaks marked 1, 2, 3 and 4 contain a four telomeric repeats at 14,674 Da as well as secondary ion product.

[0155]FIG. 64 displays a MALDI-TOF spectrum of the RT-amplified product of the human tyrosine hydroxylase rnRNA indicating the presence of neuroblastoma cells (Example 14). The signal at 18,763.8 Da represents the non-biotinylated single-stranded 61 mer of the nested amplified product (calc. 18,758.2 Da).

[0156]FIG. 65 (a) shows a schematic representation of a PROBE reaction for the RET proto-oncogene with a mixture of dATP, dCTP, dGTP, and ddTTP (Example 15). B represents biotin, through which the sense template strand is bound through streptavidin to a solid support.

[0157]FIG. 65(b) shows the expected PROBE products for ddT and ddA reactions for wildtype, C→T, and C→A antisense strands.

[0158]FIG. 66 shows the PROBE product mass spectra for (a) negative control, (b) Patient 1 being heterozygote (Wt/C→T) and (c) Patient 2 being heterozygote (Wt/C→A), reporting average IVIr values.

[0159]FIG. 67 shows the MALDI-FTMS spectra for synthetic analogs representing ribo-cleaved RET proto-oncogene amplified products from (a) wildtype, (b) G→A, and (c) G→T homozygotes, and (d) wildtype/G→A, (e) wildtype/G→T, and (f) G→A/G→T heterozygotes, reporting masses of most abundant isotope peaks.

[0160]FIG. 68 is a schematic representation of nucleic acid immobilization via covalent bifunctional trityl linkers.

[0161]FIG. 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers.

[0162]FIG. 70 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing bound oligo (5′-iminobiotin—TGCACCTGACTC, SEQ ID NO. 56). An internal standard (CTGTGGTCGTGC, SEQ ID NO. 57) was included in the matrix.

[0163]FIG. 71 shows a MALDI-TOF mass spectrum of a supernatant of the matrix treated Dynabeads containing bound oligo (5′-iminobiotin—TGCACCTGACTC, SEQ ID NO. 56). An internal standard (CTGTGGTCGTGC, SEQ ID NO. 57) was included in the matrix.

[0164]FIG. 72 schematically depicts the steps involved with the Loop-primer oligo base extension (Loop-probe) reaction.

[0165]FIG. 73A shows a MALDI-TOF mass spectrum of a supernatant after Cfol digest of a stem loop. FIG. 73B-D show MALDI-TOF mass spectrum of different genotypes: HbA the wildtype genotype (74B), HbC, a mutation of codon 6 of the β-globin gene which causes sickle cell disease (74C), and HbS, a different mutation of codon 6 of the β-globin gene which causes sickle cell disease (74D).

[0166]FIG. 74 shows the nucleic acid sequence of the amplified region of CKR-5. The underlined sequence corresponds to the region homologous to the amplification primers. The dotted region corresponds to the 32 bp deletion.

[0167]FIG. 75 shows the sense primer ckrT7f. Being designed to facilitate binding of T7-RNA polymerase and amplification of the CKR-5 region to be analyzed, it starts with a randomly chosen sequence of 24 bases, the T7 promoter sequence of 18 bases and the sequence homologous to CKR-5 of 19 bases.

[0168]FIG. 76 is a MALDI-TOF mass spectrum of the CKR-5 amplification product, which was generated as described in the following Example 21.

[0169]FIG. 77 is a positive ion UV-MALDII mass spectra of a synthetic RNA 25-mer (5′-UCCGGUCUGAUGAGUCCGUGAGGAC-3′ SEQ ID NO. 62) digested with selected RNAses. For each enzyme 0.6 μl aliquots of teh 4.5 μl assay containing a total of ca. 20 pmol of the RNA were fixed with 1.5 μl matrix (3-HPA) for analysis. Fragments with retained 5′-terminus are marked by different arrows, specific for the different RNAses, (Hahner et al., Proceedings of the 44^(th) ASMS Conference on Mass Spectrometry and Allied Topics, p. 983 (1996)).

[0170]FIG. 78 is an investigation of the specificity of the RNAses CL₃ and Cusativin by positive ion UV-MALDI mass spectra of a synthetic RNA 20-mer. Expected and/or observed cleavage! sites are indicated by arrows. A, B, C indicate correct cleavage sites and corresponding singly cleaved fragments. Missing cleavages are designated by a question mark (?), unspecific cleavages by an X.

[0171]FIG. 79 shows the separation of a mixture of DNA molecules (12-mer, 5′-biot. 19-mer, 22-mer and 5′-biot. 27-mer) with streptavidin-coated magnetic beads. a) positive ion UV-MALDI mass spectrum of 0.6 μl of a mixture containing ca. 2-4 pmol of each species mixed with 1.5 μl matrix (3-HPA). b) same as a) but incubation of the mixture with magnetic beads and subsequent release of the captured fragments.

[0172]FIG. 80 Elution of immobilized 5′ biotinylated 49 nt in vitro transcript from the streptavidin-coated magnetic beads. Positive UV-MALDI mass spectrum of the transcript prior to incubation with the magnetic beads (a). Spectra of the immobilized RNA transcript after elution with 95% formamide alone (b) and with various additives such as 10 mM EDTA (c), 10 mM CDTA (d) and 25% ammonium hydroxide (e); EDTA and CDTA were adjusted with 25% ammonium hydroxide to a pH of 8.

[0173]FIG. 81 Positive UV-MALDI mass spectra of the 5′ biotinylated 49 nt in vitro transcript after RNAse U₂ digest for 1 5 minutes. a) Spectrum of the 25 ul assay containing ca. 100 pmol of the target RNA before separation; b) spectrum after isolation of the 5′-biotinylated fragments with magnetic beads. Captured fragments were released by a solution of 95% formamide containing 10 mM CDTA. 1 ul aliquots of the samples were mixed with 1.5 ul matrix (3-HPA) in both cases.

[0174]FIG. 82 schematically depicts detection of putative mutations in the human β-globin gene at codon 5 and 6 and at codon 30, and the IVS-1 donor site, respectively, done in parallel. FIG. 82A shows amplification of genomic DNA using the primers β2 and β11. The location of the primers and identification tags as well as an indication of the wild type and mutant sequences are shown. FIG. 82B shows analysis of both sites in a simple Primer Reaction Oligo Base Extension (PROBE) using primers β-TAG1 (which binds upstream of codon 5 and 6) and β-TAG2 (which binds upstream of codon 30 and the IVS-1 donor site). Reaction products are captured using streptavidin-coated paramagnetic particle bound biotinylated capture primers (cap-tag-1 and cap-tag-2, respectively), that have 6 bases at the 5′ end that are complementary to the 5′ end of β-TAG1 and β-TAG2, respectively, and a portion which binds to a universal primer.

[0175]FIG. 83 shows a mass spectrum of the PROBE products of a DNA sample from one individual analyzed as described schematically in FIG. 82.

[0176]FIG. 84 shows a mass spectrum of the sequence bound to cap-tag-2.

[0177]FIG. 85 shows a mass spectrum obtained by using the β-TAG1 and fl-TAG 2 primers in one sequencing reaction using ddATP for termination and then sorting according to the method depicted in FIG. 82.

[0178]FIG. 86 shows a mass spectrum obtained by using the β-TAG1 and β-TAG2 primers in one sequencing reaction using ddCTP for termination and then sorting according to the method depicted in FIG. 82.

[0179]FIG. 87A shows the wildtype sequence of a fragment of the chemokine receptor CKR-5 gene with primers (bold) used for amplification. The 32 base pair (bp) deletion in the CKR-5 allele is underlined; and the stop nucleotides are in italic. In FIG. 87B, the wildtype strands are depicted with and without an added Adenosine, their length and molecular masses are indicated. FIG. 87C indicates the same for the 32 bp deletion. FIG. 87D shows the PROBE products for the wildtype gene and FIG. 87E shows the mutated allele.

[0180]FIG. 88 shows the amplification products of different unrelated individuals as analyzed by native polyacrylannide gel electrophoreses (15%) and silver stain. The band corresponding to a wildtype CKR-5 runs at 75 bp and the band from the gene with the deletion at 43 bp. Bands bigger than 75 bp are due to unspecific amplification.

[0181]FIG. 89A shows a spectrograph of DNA derived from a heterozygous individual: the peak with a mass of 23319 Da corresponds to the wildtype CKR-5 and the peaks with masses of 13137 Da and 13451 Da to the deletion allele with and without an extra Adenosine, respectively. FIG. 89B shows a spectrograph of DNA obtained from the same individual as in FIG. 89A, but the DNA was treated with T4 DNA polymerase to remove the added Adenosine. FIGS. 89C and 89D are spectrographs derived from homozygous individuals and in FIG. 89D, the Adenosine has been removed. All peaks with masses lower than 13000 Da are due to multiple charged molecules.

[0182]FIG. 90A shows the mass spectrum of the results of a PROBE reaction performed on DNA obtained from a heterozygous individual. FIG. 90B shows a mass spectrum of the results of a PROBE reaction on a homozygous individual. The peaks with masses of 6604 Da and 6607 Da, respectively correspond to the wildtype allele, and the peak with a mass of 6275 Da to the deletion allele. The primer is detected with a mass of 5673 and 5676 Da, respectively.

[0183]FIG. 91 shows a MALDI-TOF MS spectra of a thermocycling primer Oligo Base Extension (tc-PROBE) reaction as described in Example 24 using three different templates and 5 different PROBE primers simultaneously in one reaction.

[0184]FIG. 92 schematically depicts a single tube process for amplifying and sequencing exons 5-8 of the p53 gene as described in Example 25. The mass spectrum is the A reaction of FIG. 93.

[0185]FIG. 93 shows a superposition plot of four separate reactions for sequencing a portion of exon 7 of the p53 gene as described in Example 25.

[0186]FIG. 94 shows the mass spectrum obtained from the A reaction for sequencing a portion of exon 7 of the p53 gene as described in Example 25.

[0187]FIG. 95 shows the mass spectrum of a p53 sequencing ladder for which 5nL of each reaction were transferred to wells of a chip and measured by MALDI-TOF.

[0188]FIG. 96A shows a MALDI-TOF mass spectra of a synthetic 50-mer (15.34 kDa) mixed with 27-mer_(nc) (non-complementary, 8.30 kDa).

[0189]FIG. 96B shows a MALDI-TOF mass spectra of a synthetic 50-mer (15.34k Da) mixed with a 27-merc (complementary, 8.34 kDa). The final concentration of each oligonucleotide was 10 μM. The signal at 23.68 kDa in FIG. 96B corresponds to WC-specific dsDNA.

[0190]FIG. 97A shows a MALDI-TOF mass spectrum of Cfol/Rsal digest products of a region of exon 4 of the apolipoprotein E gene (ε3 genotype), using sample preparation as in FIG. 96.

[0191]FIG. 97B is the same as FIG. 97A, except with samples prepared for MALDI-TOF analysis at 4° C.

[0192]FIG. 98 shows a MALDI-TOF mass spectrum of Cfol/Rsal simultaneously double digest products of a 252 base pair region of exon 4 of the apolipoprotein E gene (e4 genotype), with samples prepared at 4° C.

[0193]FIG. 99 shows the mass spectra obatined on a small population study of 1 5 patients with a 1 6 element array of diagnostic products transferred to a MALDI target using a pintool microdispenser.

[0194]FIG. 100 is a MALDI mass spectrum of an aliquot sampled after a T₁ digest of a synthetic 20-mer RNA.

DETAILED DESCRIPTION OF THE INVENTION

[0195] Definitions

[0196] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of skill in the art to which this invention belongs. Where permitted the subject matter of each of the co-pending patent applications and the patent is herein incorporated in its entirety.

[0197] As used herein, the term “biological sample” refers to any material obtained from any living source (e.g., human, animal, plant, bacteria, fungi, protist, virus). For purposes herein, the biological sample will typically contain a nucleic acid molecule. Examples of appropriate biological samples include, but are not limited to: solid materials (e.g., tissue, cell pellets, biopsies) and biological fluids (e.g., urine, blood, saliva, amniotic fluid, mouth wash, cerebral spinal fluid and other body fluids).

[0198] As used herein, the phrases “chain-elongating nucleotides” and “chain-terminating nucleotides” are used in accordance with their art recognized meaning. For example, for DNA, chain-elongating nucleotides include 2′deoxyribonucleotides (e.g., dATP, dCTP, dGTP and dTTP) and chain-terminating nucleotides include 2′,3′-dideoxyribonucleotides (e.g., ddATP, ddCTP, ddGTP, ddTTP). For RNA, chain-elongating nucleotides include ribonucleotides (e.g., ATJP, CTP, GTP and UTP) and chain-terminating nucleotides include 3′-deoxyriboinucleotides (e.g., 3′dA, 3′dC, 3′dG and 3′dU). A complete set of chain elongating nucleotides refers to dATP, dCTP, dGTP and dTTP. The term “nucleotide” is also well known in the art.

[0199] As used herein, nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include modified nucleotides such as phosphorothioate nucleotides and deazapurine nucleotides. A complete set of chain-elongating nucleotides refers to four different nucleotides that can hybridize to each of the four different bases comprising the DNA template.

[0200] As used herein, the superscript O-i designates i+1 mass differentiated nucleotides, primers or tags. In some instances, the superscript O can designate an unmodified species of a particular reactant, and the superscript i can designate the i-th mass-modified species of that reactant. If, for example, more than one species of nucleic acids are to be concurrently detected, then i+1 different mass-modified detector oligonucleotides (D^(O), D¹, . . . D′) can be used to distinguish each species of mass modified detector oligonucleotides (D) from the others by mass spectrometry.

[0201] As used herein, “multiplexing” refers to the simultaneously detection of more than one analyte, such as more than one (mutated) loci on a particular captured nucleic acid fragment (on one spot of an array).

[0202] As used herein, the term “nucleic acid” refers to single-stranded and/or double-stranded polynucleotides such as deoxyribonucleic acid (DNA), and ribonucleic acid (RNA) as well as analogs or derivatives of either RNA or DNA. Also included in the term “nucleic acid” are analogs of nucleic acids such as peptide nucleic acid (PNA), phosphorothioate DNA, and other such analogs and derivatives.

[0203] As used herein, the term “conjugated” refers stable attachment, preferably ionic or covalent attachment. Among preferred conjugation means are: streptavidin- or avidin- to biotin interaction; hydrophobic interaction; magnetic interaction (e.g., using functionalized magnetic beads, such as DYNABEADS, which are streptavidin-coated magnetic beads sold by Dynal, Inc. Great Neck, N.Y. and Oslo Norway); polar interactions, such as “wetting” associations between two polar surfaces or between oligo/polyethylene glycol; formation of a covalent bond, such as an amide bond, disulfide bond, thioether bond, or via crosslinking agents; and via an acid-labile or photocleavable linker.

[0204] As used herein equivalent, when referring to two sequences of nucleic acids means that the two sequences in question encode the same sequence of amino acids or equivalent proteins. When “equivalent” is used in referring to two proteins or peptides, it means that the two proteins or peptides have substantially the same amino acid sequence with only conservative amino acid substitutions that do not substantially alter the activity or function of the protein or peptide. When “equivalent” refers to a property, the property does not need to be present to the same extent [e.g., two peptides can exhibit different rates of the same type of enzymatic activity], but the activities are preferably substantially the same. “Complementary,” when referring to two nucleotide sequences, means that the two sequences of nucleotides are capable of hybridizing, preferably with less than 25%, more preferably with less than 15%, even more preferably with less than 5%, most preferably with no mismatches between opposed nucleotides. Preferably the two molecules will hybridize under conditions of high stringency.

[0205] As used herein: stringency of hybridization in determining percentage mismatch are those conditions understood by those of skill in the art and typically are substantially equivalent to the following:

[0206] 1) high stringency: 0.1×SSPE, 0.1% SDS, 65° C.

[0207] 2) medium stringency: 0.2×SSPE, 0.1% SDS, 50° C.

[0208] 3) low stringency: 1.0×SSPE, 0.1% SDS, 50° C.

[0209] It is understood that equivalent stringencies may be achieved using alternative buffers, salts and temperatures.

[0210] As used herein, a primer when set forth in the claims refers to a primer suitable for mass spectrometric methods requiring immobilizing, hybridizing, strand displacement, sequencing mass spectrometry refers to a nucleic acid must be of low enough mass, typically about 70 nucleotides or less than 70, and of sufficient size to be useful in the mass spectrometric methods described herein that rely on mass spectrometric detection. These methods include primers for detection and seequening of nucleic acids, which require a sufficient number nucleotides to from a stable duplex, typically about 6-30, preferably about 10-25, more preferably about 12-20. Thus, for purposes herein a primer will be a sequence of nucleotides comprising about 6-70, more preferably a 12-70, more preferably greater than about 14 to an upper limit of 70, depending upon sequence and application of the primer. The primers herein, for example for mutational analyses, are selected to be upstream of loci useful for diagnosis such that when performing using sequencing up to or through the site of interest, the resulting fragment is of a mass that sufficient and not too large to be detected by mass spectrometry. For mass spectrometric methods, mass tags or modifier are preferably included at the 5′-end, and the primer is otherwise unlabeled.

[0211] As used herein, “conditioning” of a nucleic acid refers to modification of the phosphodiester backbone of the nucleic acid molecule (e.g, cation exchange) for the purpose of eliminating peak broadening due to a heterogeneity in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating agent such as akyliodide, iodoacetamide, β-iodoethanol, or 2,3-epoxy-1-propanol, the monothio phosphodiester bonds of a nucleic acid molecule can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nucleotides that reduce sensitivity for depurination (fragmentation during MS) e.g., a purine analog such as N7- or N9-deazapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions that are alkylated or employing oligonucleotide mimetics such as peptide nucleic acid (PNA).

[0212] As used herein, substrate refers to an insoluble support onto which a sample is deposited according to the materials described herein. Examples of appropriate substrates include beads (e.g., silica gel, controlled pore glass, magnetic, agaroase gele and crosslinked dextroses (i.e. Sepharose and Sephadex, cellulose and other materials known by those of skill in the art to serve as solid support matrices. For examples substrates may be formed from any or combitions of: silica gel, glass, magnet, polystyrene/1% divinylbenzene resins, such as Wang resins, which are Fmoc-amino acid-4-(hydroxymethyl)phenoxymethyl-copoly(styrene-1% divinylbenzene (DVD)) resin, chlorotrityl (2-chlorotritylchloride copolystyrene-DVB resin) resin, Merrifield (chloromethylated copolystyrene-DVB) resin metal, plastic, cellulose, cross-linked dextrans, such as those sold under the tradename Sephadex (Pharmacia) and agarose gel, such as gels sold under the tradename Sepharose (Pharmacia), which is a hydrogen bonded polysaccharide-type agarose gel, and other such resins and solid phase supports known to those of skill in the art. The support matrices may be in any shape or form, including, but not limited to: capillaries, flat supports such as glass fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials including multiwell plates or membranes (e.g., of polyethylene, polypropylene, polyamide, polyvinylidenedifluoride), pins (e.g., arrays of pins suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers (e.g., silicon wafers) with or without plates, and beads.

[0213] As used herein, a selectively cleavable linker is a linker that is cleaved under selected conditions, such as a photocleavable linker, a chemically cleavable linker and an enzymatically cleavable linker (i.e., a restriction endonuclease site or a ribonucleotide/RNase digestion). The linker is interposed between the support and immobilized DNA.

[0214] Isolation of nucleic acids molecules

[0215] Nucleic acid molecules can be isolated from a particular biological sample using any of a number of procedures, which are well-known in the art, the particular isolation procedure chosen being appropriate for the particular biological sample. For example, freeze-thaw and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from solid materials; heat and alkaline lysis procedures can be useful for obtaining nucleic acid molecules from urine; and proteinase K extraction can be used to obtain nucleic acid from blood (see, e.g., Rolff et al. (1994) PCR: Clinical Diagnostics and Research, Springer).

[0216] To obtain an appropriate quantity of a nucleic acid molecules on which to perform mass spectrometry, amplification may be necessary. Examples of appropriate amplification procedures for use herein include: cloning (Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 1989), polymerase chain reaction (PCR) (C. R. Newton and A. Graham, PCR, BIOS Publishers, 1994), ligase chain reaction (LCR) (see, e.g., Weidmann et al. (1994) PCR Methods Appl. Vol. 3, Pp. 57-64; F. Barany (1991) Proc. Natl. Acad. Sci. U.S.A. 88:189-93), strand displacement amplification (SDA) (see, e.g., Walker et al. (1994) Nucleic Acids Res. 22:2670-77) and variations such as RT-PCR (see, e.g., Higuchi et al. (1993) Bio/Technology 11:1026-1030), allele-specific amplification (ASA) and transcription based processes.

[0217] Immobilization of nucleic acid molecules to solid supports

[0218] To facilitate mass spectrometric analysis, a nucleic acid molecule containing a nucleic acid sequence to be detected can be immobilized to an insoluble (i.e., a solid) support. Examples of appropriate solid supports include beads (e.g., silica gel, controlled pore glass, magnetic, agarose and polyacrylamide, (SEPHADEX/SEPHAREOS, cellulose), capillaries, flat supports such as glass fiber filters, glass surfaces, metal surfaces (steel, gold, silver, aluminum, copper and silicon), plastic materials including multiwell plates or membranes (e.g., of polyethylene, polypropylene, polyamide, polyvinylidenedifluoride), pins (e.g., arrays of pins suitable for combinatorial synthesis or analysis or beads in pits of flat surfaces such as wafers (e.g., silicon wafers) with or without filter plates.

[0219] Samples containing target nucleic acids can be transferred to solid supports by any of a variety of methods known to those of skill in the art. For example, nucleic acid samples can be transferred to individual wells of a substrate, e.g., silicon chip, manually or using a pintool microdispenser apparatus as described herein. Alternatively, a piezoelectric pipette apparatus can be used to transfer small nanoliter samples to a substrate permitting the performance of high throughput miniaturized diagnostics on a chip.

[0220] Immobilization can be accomplished, for example, based on hybridization between a capture nucleic acid sequence, which has already been immobilized to the support and a complementary nucleic acid sequence, which is also contained within the nucleic acid molecule containing the nucleic acid sequence to be detected (FIG. 1A). So that hybridization between the complementary nucleic acid molecules is not hindered by the support, the capture nucleic acid can include an e.g., spacer region of at least about five nucleotides in length between the solid support and the capture nucleic acid sequence. The duplex formed will be cleaved under the influence of the laser pulse and desorption can be initiated. The solid support-bound nucleic acid molecule can be presented through natural oligoribo- or oligodeoxyribonucleotide as well as analogs (e.g., thio-modified phosphodiester or phosphotriester backbone) or employing oligonucleotide mimetics such as PNA analogs (see, e.g., Nielsen etal., Science 254:1497 (1991)) which render the base sequence less susceptible to enzymatic degradation and bound capture base sequence.

[0221] Linkers

[0222] A target detection site can be directly linked to a solid support via a reversible or irreversible bond between an appropriate functionality (L′) on the target nucleic acid molecule (T) and an appropriate functionality (L) on the capture molecule (FIG. 1B). A reversible linkage can be such that it is cleaved under the conditions of mass spectrometry (i.e., a photocleavable bond such as a charge transfer complex or a labile bond being formed between relatively stable organic radicals).

[0223] Photocleavable linkers are linkers that are cleaved upon exposure to light (see, e.g., Goldmacher et al. (1992) Bioconj. Chem. 3:104-107), thereby releasing the targeted agent upon exposure to light. Photocleavable linkers that are cleaved upon exposure to light are known (see, e.g., Hazum et al. (1981) in Pept., Proc. Eur. Pept. Symp., 16th, Brunfeldt, K (Ed), pp. 105-110, which describes the use of a nitrobenzyl group as a photocleavable protective group for cysteine; Yen et al. (1989) Makromol. Chem 1 90:69-82, which describes water soluble photocleavable copolymers, including hydroxypropylmethacrylamide copolymer, glycine copolymer, fluorescein copolymer and methylrhodamine copolymer; Goldmacher et al. (1992) Bioconj. Chem. 3:104-107, which describes a cross-linker and reagent that undergoes photolytic degradation upon exposure to near UV light (350 nm); and Senter et al. (1 985) Photochem. Photobiol 42:231-237, which describes nitrobenzyloxycarbonyl chloride cross linking reagents that produce photocleavable linkages), thereby releasing the targeted agent upon exposure to light. In preferred embodiments, the nucleic acid is immobilized using the photocleavable linker moiety that is cleaved during mass spectrometry. Presently preferred photocleavable linkers are set forth in the EXAMPLES.

[0224] Furthermore, the linkage can be formed with L′ being a quaternary ammonium group, in which case, preferably, the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone and thus facilitate the desorption required for analysis by a mass spectrometer. Desorption can occur either by the heat created by the laser pulse and/or, depending on L,′ by specific absorption of laser energy which is in resonance with the L′ chromophore.

[0225] Thus, the L-L′ chemistry can be of a type of disulfide bond (chemically cleavable, for example, by mercaptoethanol or dithioerythrol), a biotin/streptavidin system, a heterobifunctional derivative of a trityl ether group (see, e.g., Köster et al. (1 990) “A Versatile Acid-Labile Linker for Modification of Synthetic Biomolecules,” Tetrahedron Letters 31:7095) that can be cleaved under mildly acidic conditions as well as under conditions of mass spectrometry, a levulinyl group cleavable under almost neutral conditions with a hydrazinium/acetate buffer, an arginine-arginine or lysine-lysine bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond cleavable by a pyrophosphatase, or a ribonucleotide bond in between the oligodeoxynucleotide sequence, which can be cleaved, for example, by a ribonuclease or alkali.

[0226] The functionalities, L and L,′ can also form a charge transfer complex and thereby form the temporary L-L′ linkage. Since in many cases the “charge-transfer band” can be determined by UV/vis spectrometry (see, e., Organic Charge Transfer Complexes by R. Foster, Academic Press, 1969), the laser energy can be tuned to the corresponding energy of the charge-transfer wavelength and, thus, a specific desorption off the solid support can be initiated. Those skilled in the art will recognize that several combinations can serve this purpose and that the donor functionality can be either on the solid support or coupled to the nucleic acid molecule to be detected or vice versa.

[0227] In yet another approach, a reversible L-L′ linkage can be generated by homolytically forming relatively stable radicals. Under the influence of the laser pulse, desorption (as discussed above) as well as ionization will take place at the radical position. Those skilled in the art will recognize that other organic radicals can be selected and that, in relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding laser wavelength can be selected (see e.g., Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984).

[0228] An anchoring function L′ can also be incorporated into a target capturing sequence (TCS) by using appropriate primers during an amplification procedure, such as PCR (FIG. 4), LCR (FIG. 5) or transcription amplification (FIG. 6A).

[0229] When performing exonuclease sequencing using MALDI-TOF MS, a single stranded DNA molecule immobilized via its 5-end to a solid support is unilaterally degraded with a 3′-processive exonuclease and the molecular weight of the degraded nucleotide is determined sequentially. Reverse Sanger sequencing reveals the nucleotide sequence of the immobilized DNA. By adding a selectively cleavable linker, not only can the mass of the free nucleotides be determined but also, upon removal of the nucleotides by washing, the mass of the remaining fragment can be detected by MALDI-TOF upon cleaving the DNA from the solid support. Using selectively cleavable linkers, such as the photocleavable and chemical cleavable linkers provided herein, this cleavage can be selected to occur during the ionization and volatizing steps of MALDI-TOF. The same rationale applies for a 5′ immobilized strand of a double stranded DNA that is degraded while in a duplex. Likewise, this also applies when using a 5′-processive exonuclease and the DNA is immobilized through the 3′-end to the solid support.

[0230] As noted, at least three version of immobilization are contemplated herein: 1) the target nucleic acid is amplified or obtained (the target sequence or surrounding DNA sequence must be known to make primers to amplify or isolated); 2) the primer nucleic acid is immobilized to the solid support and the target nucleic acid is hybridized thereto (this is for detecting the presence of or sequencing a target sequence in a sample); or 3) a double stranded DNA (amplified or isolated) is immobilized through linkage to one predetermined strand, the DNA is denatured to eliminate the duplex and then a high concentration of a complementary primer or DNA with identity upstream from the target site is added and a strand displacement occurs and the primer is hybridized to the immobilized strand.

[0231] In the embodiments where the primer nucleic acid is immobilized on the solid support and the target nucleic acid is hybridized thereto, the inclusion of the cleavable linker allows the primer DNA to be immobilized at the 5′-end so that free 3′-OH is available for nucleic acid synthesis (extension) and the sequence of the “hybridized” target DNA can be determined because the hybridized template can be removed by denaturation and the extended DNA products cleaved from the solid support for MALDI-TOF MS. Similarly for 3), the immobilized DNA strand can be elongated when hybridized to the template and cleaved from the support. Thus, Sanger sequencing and primer oligo base extension (PROBE), discussed below, extension reactions can be performed using an immobilized primer of a known, upstreamn DNA sequence complementary to an invariable region of a target sequence. The nucleic acid from the person is obtained and the DNA sequence of a variable region (deletion, insertion, missense mutation that cause genetic predisposition or diseases, or the presence of viral/bacterial or fungal DNA) not only is detected, but the actual sequence and position of the mutation is also determined.

[0232] In other cases, the target DNA must be immobilized and the primer annealed. This requires amplifying a larger DNA based on known sequence and then sequencing the immobilized fragments (i.e., the extended fragments are hybridized but not immobilized to the support as described above). In these cases, it is not desirable to include a linker because the MALDI-TOF spectrum is of the hybridized DNA; it is not necessary to cleave the immobilized template.

[0233] Any linker known to those of skill in the art for immobilizing nucleic acids to solid supports may be used herein to link the nucleic acid to a solid support. The preferred linkers herein are the selectively cleavable linkers, particularly those exemplified herein. Other linkers include, acid cleavable linkers, such as bismaleimideothoxy propane, acid-labile trityl linkers.

[0234] Acid cleavable linkers, photocleavable and heat sensitive linkers may also be used, particularly where it may be necessary to cleave the targeted agent to permit it to be more readily accessible to reaction. Acid cleavable linkers include, but are not limited to, bismaleimideothoxy propane; and adipic acid dihydrazide linkers (see, e.g., Fattom et al. (1992) Infection & Immun. 60:584-589) and acid labile transferrin conjugates that contain a sufficient portion of transferrin to permit entry into the intracellular transferrin cycling pathway (see, e.g., Welhöner et al. (1991) J. Biol. Chem. 266:4309-4314).

[0235] Photocleavable Linkers

[0236] Photocleavable linkers are provided. In particular, photocleavable linkers as their phosphoramidite derivatives are provided for use in solid phase synthesis of oligonucleotides. The linkers contain o-nitrobenzyl moieties and phosphate linkages which allow for complete photolytic cleavage of the conjugates within minutes upon UV irradiation. The UV wavelengths used are selected so that the irradiation will not damage the oligonucleotides and are preferrably about 350-380 nm, more preferably 365 nm. The photocleavable linkers provided herein possess comparable coupling efficiency as compared to commonly used phosphoramidite monomers (see, Sinha et al. (1983) Tetrahedron Lett. 24:5843-5846; Sinha et al. (1984) Nucleic Acids Res. 12:4539-4557; Beaucage et al. (1993) Tetrahedron 49:6123-6194; and Matteucci et al. (1981) J. Am. Chem. Soc. 103:3185-3191).

[0237] In one embodiment, the photocleavable linkers have formula I:

[0238] where R²⁰ is ω-(4,4′-dimethoxytrityloxy)alkyl or ω-hydroxyalkyl; R²¹ is selected from hydrogen, alkyl, aryl, alkoxycarbonyl, aryloxycarbonyl and carboxy; R²² is hydrogen or (dialkylamino)(ω-cyanoalkoxy)P-; t is 0-3; and R⁵⁰ is alkyl, alkoxy, aryl or aryloxy.

[0239] In a preferred embodiment, the photocleavable linkers have formula II:

[0240] where R²⁰ is ω-(4,4′-dimethoxytrityloxy)alkyl, ω-hydroxyalkyl or alkyl; R²¹ is selected from hydrogen, alkyl, aryl, alkoxycarbonyl, aryloxycarbonyl and carboxy; R²² is hydrogen or (dialkylamino)(ω-cyanoalkoxy)P-; and X²⁰ is hydrogen, alkyl or OR²⁰.

[0241] In particularly preferred embodiments, R²⁰ is 3-(4,4′-dimethoxytrityloxy)propyl, 3-hydroxypropyl or methyl; R²¹ is selected from hydrogen, methyl and carboxy; R²² is hydrogen or (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is hydrogen, methyl or OR²⁰. In a more preferred embodiment, R²⁰ is 3-(4,4′-dimethoxytrityloxy)propyl; R²¹ is methyl; R²² is (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is hydrogen. In another more preferred embodiment, R²⁰ is methyl; R²¹ is methyl; R²² is (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is 3-(4,4′-dimethoxytrityloxy)propoxy.

[0242] In another embodiment, the photocleavable linkers have formula III:

[0243] where R ²³ is hydrogen or (dialkylamino)(ω-cyanoalkoxy)P-; and R ²⁴ is selected from ω-hydroxyalkoxy, ω-(4,4′-dimethoxytrityloxy)alkoxy, ω-hydroxyalkyl and ω-(4,4′-dimethoxytrityloxy)alkyl, and is unsubstituted or substituted on the alkyl or alkoxy chain with one or more alkyl groups; r and s are each independently 0-4; and R⁵⁰ is alkyl, alkoxy, aryl or aryloxy. In certain embodiments, R ²⁴ is ω-hydroxyalkyl or ω-(4,4′-dimethoxytrityloxy)alkyl, and is substituted on the alkyl chain with a methyl group.

[0244] In preferred embodiments, R²³ is hydrogen or (diisopropylamino)(2-cyanoethoxy)P-; and R²⁴ is selected from 3-hydroxypropoxy, 3-(4,4′-dimethoxytrityloxy)propoxy, 4-hydroxybutyl, 3-hydroxy-1-propyl, 1-hydroxy-2-propyl, 3-hydroxy-2-methyl-1-propyl, 2-hydroxyethyl, hydroxymethyl, 4-(4,4′-dimethoxytrityloxy)butyl, 3-(4,4′-dimethoxytrityloxy)-1-propyl, 2-(4,4′-dimethoxytrityloxy)ethyl, 1-(4,4′-dimethoxytrityloxy)-2-propyl, 3-(4,4′-dimethoxytriyloxy)-2-methyl-1-propyl and 4,4′-dimethyoxytrityloxymethyl.

[0245] In more preferred embodiments, R²³ is (diisopropylamino)(2-cyanoethoxy)P-; r and s are 0; and R²⁴ is selected from 3-(4,4′-dimethoxytrityloxy)propoxy, 4-(4,4′-dimethoxytrityloxy)butyl, 3-(4,4′-dimethoxytrityloxy)propyl, 2-(4,4′-dimethoxytrityloxy)ethyl, 1-(4,4′-dimethoxytrityloxy)-2-propyl, 3-(4,4′-dimethoxytriyloxy)-2-methyl-1-propyl and 4,4′-dimethyoxytrityloxymethyl. R²⁴ is most preferably 3-(4,4′-dimethoxytrityloxy)propoxy.

[0246] Preparation of the photocleavable linkers

[0247] A. Preparation of photocleavable linkers of formulae I or II

[0248] Photocleavable linkers of formulae I or II may be prepared by the methods described below, by minor modification of the methods by choosing the appropriate starting materials or by any other methods known to those of skill in the art. Detailed procedures for the synthesis of photocleavable linkers of formula II are provided in the Examples.

[0249] In the photocleavable linkers of formula II where X²⁰ is hydrogen, the linkers may be prepared in the following manner. Alkylation of 5-hydroxy-2-nitrobenzaldehyde with an ω-hydroxyalkyl halide, e.g., 3-hydroxypropyl bromide, followed by protection of the resulting alcohol as, e.g., a silyl ether, provides a 5-(ω-silyloxyalkoxy)-2-nitrobenzaldehyde. Addition of an organometallic to the aldehyde affords a benzylic alcohol. Organometallics which may be used include trialkylaluminums (for linkers where R²¹ is alkyl), such as trimethylaluminum, borohydrides (for linkers where R²¹ is hydrogen), such as sodium borohydride, or metal cyanides (for linkers where R²¹ is carboxy or alkoxycarbonyl), such as potassium cyanide. In the case of the metal cyanides, the product of the reaction, a cyanohydrin, would then be hydrolyzed under either acidic or basic conditions in the presence of either water or an alcohol to afford the compounds of interest.

[0250] The silyl group of the side chain of the resulting benzylic alcohols may then be exchanged for a 4,4′-dimethoxytriyl group by desilylation with, e.g., tetrabutylammonium fluoride, to give the corresponding alcohol, followed by reaction with 4,4′-dimethoxytrityl chloride. Reaction with, e.g., 2-cyanoethyl diisopropylchlorophosphoramidite affords the linkers where R²² is (dialkylamino)(ω-cyanoalkoxy)P-.

[0251] A specific example of a synthesis of a photocleavable linker of formula 11 is shown in the following scheme, which also demonstrates use of the linker in oligonucleotide synthesis. This scheme is intended to be illustrative only and in no way limits the scope of the invention. Experimental details of these synthetic transformations are provided in the Examples.

[0252] Synthesis of the linkers of formula 11 where X²⁰ is OR²⁰, 3,4-dihydroxyacetophenone is protected selectively at the 4-hydroxyl by reaction with, e.g., potassium carbonate and a silyl chloride. Benzoate esteres, propiophenones, butyrophenones, etc. may be used in place of the acetophenone. The resulting 4-silyloxy-3-hydroxyacetophenone is then alkylated at the with an alkyl halide (for linkers where R²⁰ is alkyl) at the 3-hydroxyl and desilylated with, e.g., tetrabuylammonium fluoride to afford a 3-alkoxy-4-hydroxyacetophenone. This compound is then alkylated at the 4-hydroxyl by reaction with an ω-hydroxyalkyl halide, e.g., 3-hydroxypropyl bromide, to give a 4-(ω-hydroxyalkoxy)-3-alkoxyacetophenone. The side chain alcohol is then protected as an ester, e.g., an acetate. This compound is then nitrated at the 5-position with, e.g., concentrated nitric acid to provide the corresponding 2-nitroacetophenones. Saponification of the side chain ester with, e.g., potassium carbonate, and reduction of the ketone with, e.g., sodium borohydride, in either order gives a 2-nitro-4-(ω-hydroxyalkoxy)-5-alkoxybenzylic alcohol.

[0253] Selective protection of the side chain alcohol as the corresponding 4,4′-dimethoxytrityl ether is then accomplished by reaction with 4,4′-dimethoxytrityl chloride. Further reaction with, e.g., 2-cyanoethyl diisopropylchlorophosphoramidite affords the linkers where R²² is (dialkylamino)(ω-cyanoalkoxy)P-.

[0254] A specific example of the synthesis of a photocleavable linker of formula II is shown the following scheme. This scheme is intended to be illustrative only and in no way limit the scope of the invention. Detailed experimental procedures for the transformations shown are found in the Examples.

[0255] B. Preparation of photocleavable linkers of formula III

[0256] Photocleavable linkers of formula IlIl may be prepared by the methods described below, by minor modification of the methods by choosing appropriate starting materials, or by other methods known to those of skill in the art.

[0257] In general, photocleavable linkers of formula IlI are prepared from ω-hydroxyalkyl- or alkoxyaryl compounds, in particular ω-hydroxy-alkyl or alkoxy-benzenes. These compounds are commercially available, or may be prepared from an ω-hydroxyalkyl halide (e.g., 3-hydroxypropyl bromide) and either phenyllithium (for the ω-hydroxyalkylbenzenes) or phenol (for the ω-hydroxyalkoxybenzenes). Acylation of the ω-hydroxyl group (e.g., as an acetate ester) followed by Friedel-Crafts acylation of the aromatic ring with 2-nitrobenzoyl chloride provides a 4-(ω-acetoxy-alkyl or alkoxy)-2-nitrobenzophenone. Reduction of the ketone with, e.g., sodium borohydride, and saponification of the side chain ester are performed in either order to afford a 2-nitrophenyl-4-(hydroxy-alkyl or alkoxy)phenylmethanol. Protection of the terminal hydroxyl group as the corresponding 4,4′-dimethoxytrityl ether is achieved by reaction with 4,4′-dimethoxytrityl chloride. The benzylic hydroxyl group is then reacted with, e.g., 2-cyanoethyl diisopropylchlorophosphoramidite to afford linkers of formula II where R²³ is (dialkylamino)(ω-cyanoalkoxy)P-.

[0258] Other photocleavable linkers of formula IlIl may be prepared by substituting 2-phenyl-1-propanol or 2-phenylmethyl-1-propanol for the ω-hydroxy-alkyl or alkoxy-benzenes in the above synthesis. These compounds are commercially available, but may also be prepared by reaction of, e.g., phenylmagnesium bromide or benzylmagnesium bromide, with the requisite oxirane (i.e., propylene oxide) in the presence of catalytic cuprous ion.

[0259] Chemically cleavable linkers

[0260] A variety of chemically cleavable linkers may be used to introduce a cleavable bond between the immobilized nucleic acid and the solid support. Acid-labile linkers are presently preferred chemically cleavable linkers for mass spectrometry, especially MALDI-TOF MS, because the acid labile bond is cleaved during conditioning of the nucleic acid upon addition of the 3-HPA matrix solution. The acid labile bond can be introduced as a separate linker group, e.g., the acid labile trityl groups (see FIG. 68; Example 1 6) or may be incorporated in a synthetic nucleic acid linker by introducing one or more silyl internucleoside bridges using diisopropylsilyl, thereby forming diisopropylsilyl-linked oligonucleotide analogs. The diisopropylsilyl bridge replaces the phoshodiester bond in the DNA backbone and under mildly acidic conditions, such as 1.5% trifluoroacetic acid (TFA) or 3-HPA/1% TFA MALDI-TOF matrix solution, results in the introduction of one or more intra-strand breaks in the DNA molecule. Methods for the preparation of diisopropylsilyl-linked oligonucleotide precursors and analogs are known to those of skill in the art (see e.g., Saha et al. (1993) J. Org. Chem. 58:7827-7831). These oligonucleotide analogs may be readily prepared using solid state oligonucleotide synthesis methods using diisopropylsilyl derivatized deoxyribonucleosides.

[0261] Nucleic acid conditioning

[0262] Prior to mass spectrometric analysis, it may be useful to “condition” nucleic acid molecules, for example to decrease the laser energy required for volatilization and/or to minimize fragmentation. Conditioning is preferably performed while a target detection site is immobilized. An example of conditioning is modification of the phosphodiester backbone of the nucleic acid molecule (e.g., cation exchange), which can be useful for eliminating peak broadening due to a heterogeneity in the cations bound per nucleotide unit. Contacting a nucleic acid molecule with an alkylating agent such as akyliodide, iodoacetamide, β-iodoethanol, or 2,3-epoxy-1-propanol, the monothio phosphodiester bonds of a nucleic acid molecule can be transformed into a phosphotriester bond. Likewise, phosphodiester bonds may be transformed to uncharged derivatives employing trialkylsilyl chlorides. Further conditioning involves incorporating nucleotides that reduce sensitivity for depurination (fragmentation during MS) e.g., a purine analog such as N7- or N9-deazapurine nucleotides, or RNA building blocks or using oligonucleotide triesters or incorporating phosphorothioate functions which are alkylated or employing oligonucleotide mimetics such as PNA.

[0263] Multiplex reactions

[0264] For certain applications, it may be useful to simultaneously detect more than one (mutated) loci on a particular captured nucleic acid fragment (on one spot of an array) or it may be useful to perform parallel processing by using oligonucleotide or oligonucleotide mimetic arrays on various solid supports. “Multiplexing” can be achieved by several different methodologies. For example, several mutations can be simultaneously detected on one target sequence by employing corresponding detector (probe) molecules (e.g., oligonucleotides or oligonucleotide mimetics). The molecular weight differences between the detector oligonucleotides D1, D2 and D3 must be large enough so that simultaneous detection (multiplexing) is possible. This can be achieved either by the sequence itself (composition or length) or by the introduction of mass-modifying functionalities M1-M3 into the detector oligonucleotide (seel FIG. 2).

[0265] Mass modification of nucleic acids

[0266] Mass modifying moieties can be attached, for instance, to either the 5′-end of the oligonucleotide (M¹), to the nucleobase (or bases) (M², M⁷), to the phosphate backbone (M³), and to the 2′-position of the nucleoside (nucleosides) (M⁴, M⁶) and/or to the terminal 3′-position (M⁵). Examples of mass modifying moieties include, for example, a halogen, an azido, or of the type, XR, wherein X is a linking group and R is a mass-modifying functionality. The mass-modifying functionality can thus be used to introduce defined mass increments into the oligonucleotide molecule.

[0267] The mass-modifying functionality can be located at different positions within the nucleotide moiety (see, e.g., U.S. Pat. No. 5,547,835 and International PCT application No. WO 94/21822). For example, the mass-modifying moiety, M, can be attached either to the nucleobase, M² (in case of the c⁷ -deazanucleosides also to C-7, M⁷), to the triphosphate group at the alpha phosphate, M³, or to the 2′-position of the sugar ring of the nucleoside triphosphate, M⁴ and M⁶. Modifications introduced at the phosphodiester bond (M4), such as with alpha-thio nucleoside triphosphates, have the advantage that these modifications do not interfere with accurate Watson-Crick base-pairing and additionally allow for the one-step post-synthetic site-specific modification of the complete nucleic acid molecule e.g., via alkylation reactions (see, e.g., Nakamaye et al. (1988) Nucl. Acids Res. 16:9947-59). Particularly preferred mass-modifying functionalities are boron-modified nucleic acids since they are better incorporated into nucleic acids by polymerases (see, e.g., Porter et al. (1995) Biochemistry 34:11 963-11969; Hasan et al. (1 996) Nucleic Acids Res. 24:2150-2157; Li et al. (1995) Nucl. Acids Res. 23:4495-4501).

[0268] Furthermore, the mass-modifying functionality can be added so as to affect chain termination, such as by attaching it to the 3′-position of the sugar ring in the nucleoside triphosphate, M⁵. For those skilled in the art, it is clear that many combinations can be used in the methods provided herein. In the same way, those skilled in the art will recognize that chain-elongating nucleoside triphosphates can also be mass-modified in a similar fashion with numerous variations and combinations in functionality and attachment positions.

[0269] Without being bound to any particular theory, the mass-modification, M, can be introduced for X in XR as well as using oligo-/polyethylene glycol derivatives for R. The mass-modifying increment in this case is 44, i.e. five different mass-modified species can be generated by just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m=1), 133 (m=2), 177 (m =3) and 221 (m =4) to the nucleic acid molecule (e.g., detector oligonucleotide (D) or the nucleoside triphosphates (FIG. 6(C)), respectively). The oligo/polyethylene glycols can also be monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t-butyl and the like. A selection of linking functionalities, X, are also illustrated. Other chemistries can be used in the mass-modified compounds (see, e.g., those described in Oligonucleotides and Analogues, A Practical Approach, F. Eckstein, editor, IRL Press, Oxford, 1991).

[0270] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, can be selected and attached via appropriate linking chemistries, X. A simple mass-modification can be achieved by substituting H for halogens like F, Cl, Br and/or I, or pseudohalogens such as CN, SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or functional groups such as CH₂F, CHF₂, CF₃, Si(CH₃)₃, Si(CH₃)₂(C₂H₅), Si(CH₃)(C₂H₅)₂, Si(C₂H₅)₃. Yet another mass-modification can be obtained by attaching homo- or heteropeptides through the nucleic acid molecule (e.g., detector (D)) or nucleoside triphosphates. One example. useful in generating mass-modified species with a mass increment of 57. is the attachment of oligoglycines, e.g., mass-modifications of 74 (r=1, m=0), 131 (r=1, m =1), 188 (r=1, m=2), 245 (r=1, m =3) are achieved. Simple oligoamides also can be used, e.g., mass-modifications of 74 (r=1, m=0), 88 (r=2, m=0), 102 (r=3, m=0), 116(r=4, m=0), etc. are obtainable. Variations in additions to those set forth herein will be apparent to the skilled artisan.

[0271] Different mass-modified detector oligonucleotides can be used to simultaneously detect all possible variants/mutants simultaneously (FIG. 6B). Alternatively, all four base permutations at the site of a mutation can be detected by designing and positioning a detector oligonucleotide, so that it serves as a primer for a DNA/RNA polymerase with varying combinations of elongating and terminating nucleoside triphosphates (FIG. 6C). For example, mass modifications also can be incorporated during the amplification process.

[0272]FIG. 3 shows a different multiplex detection format, in which differentiation is accomplished by employing different specific capture sequences which are position-specifically immobilized on a flat surface (e.g., a ‘chip array’). If different target sequences T1-Tn are present, their target capture sites TCS1-TCSn will specifically interact with complementary immobilized capture sequences C1-Cn. Detection is achieved by employing appropriately mass differentiated detector oligonucleotides D1-Dn, which are mass modifying functionalities M1-Mn.

[0273] Mass spectrometric methods for sequencing DNA

[0274] Amenable mass spectrometric formats for use herein include the ionization (I) techniques, such as matrix assisted laser desorption ionization (MALDI), electrospray (ESI) (eq., continuous or pulsed); and related methods (e.g., lonspray, Thermospray, Fast Atomic Bombardment), and massive cluster impact (MCI); these ion sources can be matched with detection formats including lin-linear fields) time-of-flight (TOF), single or multiple quadrupole, single or multiple magnetic sector, Fourier transform ion cyclotron resonance (FTICR), ion trap, or combinations of these to give a hybrid detector (e.g., ion trap—time of flight). For ionization, numerous matrix/wavelength combinations including frozen analyte preparation (MALDI) or solvent combinations (ESI) can be employed.

[0275] Since a normal DNA molecule includes four nucleotide units (A, T, C, G), and the mass of each of these is unique (monoisotopic masses 313.06, 304.05, 289.05, 329.05 Da, respectively), an accurate mass determination can define or constrain the possible base compositions of that DNA. Only above 4900 Da does each unit molecular weight have at least one allowable composition; among all 5-mers there is only one non-unique nominal molecular weight, among 8-mers, 20. For these and larger oligonucleotides, such mass overlaps can be resolved with the ˜{fraction (1/10)}⁵ (˜10 part per million, ppm) mass accuracy available with high resolution FTICR MS. For the 25-mer A5T₂₀, the 20 composition degeneracies when measured at ±0.5 Da is reduced to three (A₅T₂₀, T₄C₁₂G₉, AT₃C₄G₁₆) when measured with 2 ppm accuracy. Given composition constraints (e.g., the presence or absence of one of the four bases in the strand) can reduce this further (see below).

[0276] Medium resolution instrumentation, including but not exclusively curved field reflectron or delayed extraction time-of-flight MS instruments, can also result in improved DNA detection for sequencing or diagnostics. Either of these are capable of detecting a 9 Da (Δm (A-T)) shift in ≧30-mer strands generated from, for example primer oligo base extension (PROBE), or competitive oligonucleotide single base extension (COSBE), sequencing, or direct detection of small amplified products.

[0277] BiomassScan

[0278] In this embodiment, exemplified in Example 33, two single stranded nucleic acids are individually immobilized to solid supports. One support contains a nucleic acid encoding the wild type sequence whereas the other support contains a nucleic acid encoding a mutant target sequence. Total human genomic DNA is digested with one or more restriction endonuclease enzyme resulting in the production of small fragments of double stranded genomic DNA (10-1,000 bp). The digested DNA is incubated with the immobilized single stranded nucleic acids and the sample is heated to denature the DNA duplex. The immobilized nucleic acid competes with the other genomic DNA strand for the complementary DNA strand and under the appropriate conditions, a portion of the complementary DNA strand hybridizes to the immobilized nucleic acid resulting in a strand displacement. By using high stringency washing conditions, the two nucleic acids will remain as a DNA duplex only if there is exact identity between the immobilized nucleic acid and the genomic DNA strand. The DNA that remains hybridized to the immobilized nucleic acid is analyzed by mass spectrometry and detection of a signal in the mass spectrum of the appropriate mass is diagnostic for the wild type or mutant allele. In this manner, total genomic DNA can be isolated from a biological sample and screened for the presence or absence of certain mutations. By immobilizing a variety of single stranded nucleic acids in an array format, a panel of mutations may be simultaneously screened for a number of genetic loci (i.e., multiplexing).

[0279] In addition, using less stringent washing conditions the hybridized DNA strand may be analyzed by mass spectrometry for changes in the mass resulting from a deletion or insertion within the targeted restriction endonuclease fragment.

[0280] Primer oligonucleotide base extension

[0281] As described in detail in the following Example 11, the primer oligo base extension (PROBE) method combined with mass spectrometry identifies the exact number of repeat units (i.e. the number of nucleotides in homogenous stretches) as well as second site mutations within a polymorphic region, which are otherwise only detectable by sequencing. Thus, the PROBE technique increases the total number of detectable alleles at a distinct genomic site, leading to a higher polymorphism information content (PIC) and yielding a far more definitive identification in for instance statistics-based analyses in paternity or forensics applications.

[0282] The method is based on the extension of a detection primer that anneals adjacent to a variable nucleotide tandem repeat (VNTR) or a polymorphic mononucleotide stretch using a DNA polymerase in the presence of a mixture of deoxyNTPs and those dideoxyNTPs that are not present in the deoxy form. The resulting products are evaluated and resolved by MALDI-TOF mass spectrometry without further labeling of the DNA. In a simulated routine application with 28 unrelated individuals, the mass error of this procedure using external calibration was in the worst case 0.38% (56-mer), which is comparable to approximately 0.1 base accuracy; routine standard mass deviations are in the range of 0.1% (.03 bases). Such accuracy with conventional electrophoretic methods is not realistic, underscoring the value of PROBE and mass spectrometry in forensic medicine and paternity testing.

[0283] The ultra-high resolution of Fourier Transform mass spectrometry makes possible the simultaneous measurement of all reactions of a Sanger or Maxam Gilbert sequencing experiment, since the sequence may be read from mass differences instead of base counting from 4 tubes.

[0284] Additionally, the mass differences between adjacent bases generated from unilateral degradation in a stepwise manner by an exonuclease can be used to read the entire sequence of fragments generated. Whereas UV or fluorescent measurements will not discriminate mixtures of the nucleoside/nucleotide which are generated when the exonuclease enzyme gets out of phase, this is no problem with mass spectrometry since the resolving power in differentiating between the molecular mass of dA, dT, dG and dC is more than significant. The mass of the adjacent bases (i.e., nucleotides) can be determined, for example, using Fast Atomic Bombardment (FAB) or Electronspray Ionization (ESI) mass spectrometry.

[0285] New mutation screening over an entire amplified product can be achieved by searching for mass shifted fragments generated in an endonuclease digestion as described in detail in the following Examples 4 and 12.

[0286] Partial sequence information obtained from tandem mass spectrometry (MS^(n)) can place composition constraints as described in the preceding paragraph. For the 25-mer above, generation of two fragment ions formed by collisionally activated dissociation (CAD) which differ by 313 Da discounts T₄C₁₂G₉, which contains no A nucleotides; confirming more than a single A eliminates AT₃C₄G₁₆ as a possible composition.

[0287] MS^(n) can also be used to determined full or partial sequences of larger DNAs; this can be used to detect, locate, and identify new mutations in a given gene region. Enzymatic digest products whose masses are correct need not be further analyzed; those with mass shifts could be isolated in real time from the complex mixture in the mass spectrometer and partially sequenced to locate the new mutation.

[0288] Table I describes the mutation/polymorphism detection tests that have been developed. TABLE I Mutation/Polymorphism Detection Tests Clinical Association Gene Mutation/Polymorphism Cystic Fibrosis CFTR 38 disease causing mutations in 14 exons/introns Heart Disease (Cholesterol Apo E 112R, 112C, 158R, 158C Metabolism) Apo A-IV 347S, 347T, 360H, 360Q Apo B-100 3500Q, 3500R Thyroid Cancer RET proto- C634W, C634T, C634R, oncogene C634S, C634F Sickle Cell Anemia/ beta-globin Sickle cell anemia S and C Thalassemia 45 thalassemia alleles HIV Susceptibility CKR-5 32 bp deletion Breast Cancer BRCA-2 2 bp (AG) deletion in exon 2 Susceptibility Thrombosis Factor V R506Q Arteriosclerosis GpIIIa L33P E-selectin 5128R Hypertension ACE I/D polymorphism

[0289] Detection of mutations

[0290] Diagnosis of genetic diseases

[0291] The mass spectrometric processes described above can be used, for example, to diagnose any of the more than 3000 genetic diseases currently known (e.g., hemophilias, thalassemias, Duchenne Muscular Dystrophy (DMD), Huntington's Disease (HD), Alzheimer's Disease and Cystic Fibrosis (CF)) or to be identified.

[0292] The following Example 3 provides a mass spectrometric method for detecting a mutation (ΔF508) of the cystic fibrosis transmembrane conductance regulator gene (CFTR), which differs by only three base pairs (900 daltons) from the wild type of CFTR gene. As described further in Example 3, the detection is based on a single-tube, competitive oligonucleotide single base extension (COSBE) reaction using a pair of primers with the 3′-terminal base complementary to either the normal or mutant allele. Upon hybridization and addition of a polymerase and the nucleoside triphosphate one base downstream, only those primers properly annealed (i..e, no 3′-terminal mismatch) are extended; products are resolved by molecular weight shifts as determined by matrix assisted laser desorption ionization time-of-flight mass spectrometry. For the cystic fibrosis ΔF508 polymorphism, 28-mer ‘normal’ (N) and 30-mer ‘mutant’ (M) primers generate 29- and 31-mers for N and M homozygotes, respectively, and both for heterozygotes. Since primer and product molecular weights are relatively low (<10 kDa) and the mass difference between these are at least that of a single ˜300 Da nucleotide unit, low resolution instrumentation is suitable for such measurements.

[0293] Thermosequence cycle sequencing, as further described in Example 11, is also useful for detecting a genetic disease.

[0294] In addition to mutated genes, which result in genetic disease, certain birth defects are the result of chromosomal abnormalities such as Trisomy 21 (Down's Syndrome), Trisomy 13 (Patau Syndrome), Trisomy 18 (Edward's Syndrome), Monosomy X (Turner's Syndrome) and other sex chromosome aneuploidies such as Klienfelter's Syndrome (XXY). Here, “house-keeping” genes encoded by the chromosome in question are present in different quantity and the different amount of an amplified fragment compared to the amount in a normal chromosomal configuration can be determined by mass spectrometry.

[0295] Further, there is growing evidence that certain DNA sequences may predispose an individual to any of a number of diseases such as diabetes, arteriosclerosis, obesity, various autoimmune diseases and cancer (e.g., colorectal, breast, ovarian, lung). Also, the detection of “DNA fingerprints”, e.g., polymorphisms, such as “mini- and micro-satellite sequences”, are useful for determining identity or heredity (e.g., paternity or maternity).

[0296] The following Examples 4 and 12 provide mass spectrometer based methods for identifying any of the three different isoforms of human apolipoprotein E, which are coded by the E2, E3 and E4 alleles. For example, the molecular weights of DNA fragments obtained after restriction with appropriate restriction endonucleases can be used to detect the presence of a mutation and/or a specific allele.

[0297] Depending on the biological sample, the diagnosis for a genetic disease, chromosomal aneuploidy or genetic predisposition can be preformed either pre- or post-natally.

[0298] Diagnosis of cancer

[0299] Preferred mass spectrometer-based methods for providing an early indication of the existence of a tumor or a cancer are provide herein. For example, as described in Example 13, the telomeric repeat amplification protocol (TRAP) in conjunction with telomerase specific extension of a substrate primer and a subsequent amplification of the telomerase specific extension products by an amplification step using a second primer complementary to the repeat structure was used to obtain extension ladders, that were easily detected by MALDI-TOF mass spectrometry as an indication of telomerase activity and therefor tumorigenesis.

[0300] Alternatively, as described in Example 14, expression of a tumor or cancer associated gene (e.g., human tyrosine 5-hydroxylase) via RT-PCR and analysis of the amplified products by mass spectrometry can be used to detect the tumor or cancer (e.g., biosynthesis of catecholamine via tyrosine 5-hydroxylase is a characteristic of neuroblastoma).

[0301] Further, a primer oligo base extension reaction and detection of products by mass spectrometry provides a rapid means for detecting the presence of oncogenes, such as the RET proto oncogene codon 634, which is related to causing multiple endocrine neoplasia, type II (MEN II), as described in Example 15.

[0302] Diagnosis of infection

[0303] Viruses, bacteria, fungi and other infectious organisms contain distinct nucleic acid sequences, which are different from the sequences contained in the host cell. Detecting or quantitating nucleic acid sequences that are specific to the infectious organism is important for diagnosing or monitoring infection. Examples of disease causing viruses that infect humans and animals and which may be detected by the disclosed processes include: Retroviridae (e.g., human immunodeficiency viruses, such as HIV-1 (also referred to as HTLV-III, LAV or HTLV-III/LAV, see, e.g., Ratner et al. (1985) Nature 313: 227-284; Wain-Hobson et al. (1985) Cell 40:9-17); HIV-2 (see, Guyader et al. (1987) Nature 328:662-669 European Patent Publication No. 0 269 520; Chakrabarti et al. (1987) Nature 328:543-547; and European Patent Application No. 0 655 501); and other isolates, such as HIV-LP (International PCT application No. WO 94/00562 entitled “A Novel Human Immunodeficiency Virus”; Picornaviridae (e.g., polio viruses, hepatitis A virus, (see, e.g., Gust et al. (1983) Intervirology 20:1-7); entero viruses, human coxsackie viruses, rhinoviruses, echoviruses); Calciviridae (e.g., strains that cause gastroenteritis); Togaviridae (e.g., equine encephalitis viruses, rubella viruses); Flaviridae (e.g., dengue viruses, encephalitis viruses, yellow fever viruses); Coronaviridae (e.g., coronaviruses); Rhabdoviridae (e.g., vesicular stomatitis viruses, rabies viruses); Filoviridae (e.g., ebola viruses); Paramyxoviridae (e.g., parainfluenza viruses, mumps virus, measles virus, respiratory syncytial virus); Orthomyxoviridae (e.g., influenza viruses); Bungaviridae (e.g., Hantaan viruses, bunga viruses, phleboviruses and Nairo viruses); Arena viridae (hemorrhagic fever viruses); Reoviridae (e.g., reoviruses, orbiviruses and rotaviruses); Birnaviridae; Hepadnaviridae (Hepatitis B virus); Parvoviridae (parvoviruses); Papovaviridae (papilloma viruses, polyoma viruses); Adenoviridae (most adenoviruses); Herpesviridae (herpes simplex virus (HSV) 1 and 2, varicella zoster virus, cytomegaovirus (CMV), herpes viruses'); Poxviridae (variola viruses, vaccinia viruses, pox viruses); and Iridoviridae (e.g., African swine fever virus); and unclassified viruses (e.g., the etiological agents of Spongiform encephalopathies, the agent of delta hepatitis (thought to be a defective satellite of hepatitis B virus), the agents of non-A, non-B hepatitis (class 1=internally transmitted; class 2=parenterally transmitted (i.e., Hepatitis C); Norwalk and related viruses, and astroviruses).

[0304] Examples of infectious bacteria include, but are not limited to: Helicobacter pyloris, Borelia burgdorferi, Legionella pneumophilia, Mycobacteria sps (e.g., M. tuberculosis, M. avium, M. intracellulare, M. kansaii, M. gordonae), Staphylococcus aureus, Neisseria gonorrhoeae, Neisseria meningitidis, Listeria monocytogenes, Streptococcus pyogenes (Group A Streptococcus), Streptococcus agalactiae (Group B Streptococcus), Streptococcus (viridans group), Streptococcus faecalis, Streptococcus bovis, Streptococcus (anaerobic sps.), Streptococcus phhenumoniae, pathogenic Campylobacter sp., Enterococcus sp., Haemophilus influenzae, Bacillus antracis, corynebacterium diphtheriae, corynebacterium sp., Erysipelothrix rhusiopathiae, Clostridium perfringers, Clostridium tetani, Enterobacter aerogenes, Klebsiella pneumoniae, Pasturella multocida, Bacteroides sp., Fusobacterium nucleatum, Streptobacillus moniliformis, Treponema pallidium, Treponema pertenue, Leptospira, and Actinomyces israelli.

[0305] Examples of infectious fungi include: Cryptococcus neoformans, Histoplasma capsulatum, Coccidioides immitis, Blastomyces dermatitidis, Chlamydia trachomatis, Candida albicans. Other infectious organisms (i.e., protists) include: Plasmodium falciparum and Toxoplasma gondii.

[0306] The processes provided herein makes use of the known sequence information of the target sequence and known mutation sites. Although new mutations can also be detected. For example, as shown in FIG. 8, transcription of a nucleic acid molecule obtained from a biological sample can be specifically digested using one or more nucleases and the fragments captured on a solid support carrying the corresponding complementary nucleic acid sequences. Detection of hybridization and the molecular weights of the captured target sequences provide information on whether and where in a gene a mutation is present. Alternatively, DNA can be cleaved by one or more specific endonucleases to form a mixture of fragments. Comparison of the molecular weights between wildtype and mutant fragment mixtures results in mutation detection.

[0307] Sequencing by generation of specifically terminated fragements

[0308] In another embodiment, an accurate sequence determination of a relatively large target nucleic acid, can be obtained by generating specifically terminated fragments from the target nucleic acid, determining the mass of each fragment by mass spectrometry and ordering the fragments to determine the sequence of the larger target nucleic acid. In a preferred embodiment, the specifically terminated fragments are partial or complete base-specifically terminated fragments.

[0309] One method for generating base specifically terminated fragments involves using a base-specific ribonuclease after e.g., a transcription reaction. Preferred base-specific ribonucleases are selected from among: T₁-ribonuclease (G-specific), U₂-ribonuclease (A-specific), PhyM-ribonuclease U specific and ribonuclease A (U/C specific). Other efficient and base-specific ribonucleases can be identified using the assay described in Example 16. Preferably modified nucleotides are included in the transcription reaction with unmodified nucleotides. Most preferably, the modified nucleotides and unmodified nucleotides are added to the transcription reaction at appropriate concentrations, so that both moieties are incorporated at a preferential rate of about 1:1. Alternatively, two separate transcriptions of the target DNA sequence one with the modified and one with the unmodified nucleotides can be performed and the results compared. Preferred modified nucleotides include: boron or bromine modified nucleotides (Porter et al. (1995) Biochemistry 34:11963-11969; Hasan et al. (1996) Nucl. Acids Res. 24:2150-2157; Li et al. (1995) Nucleic Acids Res. 23:4495-4501), α-thio-modified nucleotides, as well as mass-modified nucleotides as described above.

[0310] Another method for generating base specifically terminated fragments involves performing a combined amplification and base-specific termination reaction. For example, a combined amplification and termination reaction can be performed using at least two different polymerase enzymes, each having a different affinity for the chain terminating nucleotide, so that polymerization by an enzyme with relatively low affinity for the chain terminating nucleotide leads to exponential amplification whereas an enzyme with relatively high affinity for the chain terminating nucleotide terminates the polymerization and yields sequencing products.

[0311] The combined amplification and sequencing can be based on any amplification procedure that employs an enzyme with polynucleotide synthetic ability (e.g., polymerase). One preferred process, based on the polymerase chain reaction (PCR), includes the following three thermal steps: 1) denaturing a double stranded (ds) DNA molecule at an appropriate temperature and for an appropriate period of time to obtain the two single stranded (ss) DNA molecules (the template: sense and antisense strand); 2) contacting the template with at least one primer that hybridizes to at least one ss DNA template at an appropriate temperature and for an appropriate period of time to obtain a primer containing ss DNA template; 3) contacting the primer containing template at an appropriate temperature and for an appropriate period of time with: (i) a complete set of chain elongating nucleotides, (ii) at least one chain terminating nucleotide, (iii) a first DNA polymerase, which has a relatively low affinity towards the chain terminating nucleotide; and (iv) a second DNA polymerase, which has a relatively high affinity towards the chain terminating nucleotide.

[0312] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired amount of amplified sequencing ladders. The quantity of the base specifically terminated fragment desired dictates how many cycles are performed. Although an increased number of cycles results in an increased level of amplification, it may also detract from the sensitivity of a subsequent detection. It is therefore generally undesirable to perform more than about 50 cycles, and is more preferable to perform less than about 40 cycles (e.g., about 20-30 cycles).

[0313] Another preferred process for simultaneously amplifying and chain terminating a nucleic acid sequence is based on strand displacement amplification (SDA) (see, e.g., Walker et al. (1994) Nucl. Acids Res. 22:2670-77; European Patent Publication Number 0 684 315 entitled “Strand Displacement Amplification Using Thermophilic Enzymes”). In essence, this process involves the following three steps, which altogether constitute a cycle: 1) denaturing a double stranded (ds) DNA molecule containing the sequence to be amplified at an appropriate temperature and for an appropriate period of time to obtain the two single stranded (ss) DNA molecules (the template: sense and antisense strand); 2) contacting the template with at least one primer (P), that contains a recognition/cleavage site for a restriction endonuclease (RE) and that hybridizes to at least one ss DNA template at an appropriate temperature and for an appropriate period of time to obtain a primer containing ss DNA template; 3) contacting the primer containing template at an appropriate temperature and for an appropriate period of time with (i) a complete set of chain elongating nucleotides; (ii) at least one chain terminating nucleotide; (iii) a first DNA polymerase, which has a relatively low affinity towards the chain terminating nucleotide; (iv) a second DNA polymerase, which has a relatively high affinity towards the chain terminating nucleotide; and (v) an RE that nicks the primer recognition/cleavage site.

[0314] Steps 1)-3) can be sequentially performed for an appropriate number of times (cycles) to obtain the desired amount of amplified sequencing ladders. As with the PCR based process, the quantity of the base specifically terminated fragment desired dictates how many cycles are performed. Preferably, less than 50 cycles, more preferably less than about 40 cycles and most preferably about 20 to 30 cycles are performed.

[0315] Preferably about 0.5 to about 3 units of polymerase is used in the combined amplification and chain termination reaction. Most preferably about 1 to 2 units is used. Particularly preferred polymerases for use in conjunction with PCR or other thermal amplification process are thermostable polymerases, Pyrococcus species GB-D DNA polymerase, such as DEEP VENTR DNA Polymerase, Thermococcus litoralis DNA polymerase, such as VENT_(R) ^(R) DNA Polymerase-, Taq DNA polymerase (Boehringer Mannheim), AmpliTaq FS DNA polymerase (Perkin-Elmer), Deep Vent (exo-), Vent, Vent (exo-) and Deep Vent DNA polymerases (New England Biolabs), Thermo Sequenase (Amersham) or exo(-) Pseudococcus furiosus (Pfu) DNA polymerase (Stratagene, Heidelberg, Germany), AmpliTaq, Ultman, 9 degree Nm, Tth, Hot Tub, and Pyrococcus furiosus. In addition, preferably the polymerase does not have 5′-3′ exonuclease activity.

[0316] In addition to polymerases, which have a relatively high and a relatively low affinity to the chain terminating nucleotide, a third polymerase, which has proofreading capacity (e.g., Pyrococcus woesei (Pwo)) DNA polymerase may also be added to the amplification mixture to enhance the fidelity of amplification.

[0317] Yet another method for generating base specifically terminated fragments involves contacting an appropriate amount of the target nucleic acid with a specific endonuclease or exonuclease. Preferably, the original 5′ and/or 3′ end of the nucleic acid is tagged to facilitate the ordering of fragments. Tagging of the 3′ end is particularly preferred when in vitro nucleic acid transcripts are being analyzed, so that the influence of 3′ heterogeneity, premature termination and nonspecific elongation can be minimized. 5′ and 3′ tags can be natural (e.g., a 3′ poly A tail or 5′ or 3′ heterogeneity) or artificial. Preferred 5′ and/or 3′ tags are selected from among the molecules described for mass-modification above.

[0318] The methods provided herein are further illustrated by the following examples, which should not be construed as limiting in any way.

EXAMPLE 1

[0319] MALDI-TOF desorption of oligonucleotides directly on solid supports

[0320] 1 g CPG (Controlled Pore Glass) was functionalized with 3-(triethoxysilyl)-epoxypropan to form OH-groups on the polymer surface. A standard oligonucleotide synthesis with 13 mg of the OH-CPG on a DNA synthesizer (Milligen, Model 7500) employing β-cyanoethyl-phosphoamidites (Köster et al. (1994) Nucleic Acids Res. 12:4539) and TAC N-protecting groups (Köster et al. (1981) Tetrahedron 37:362) was performed to synthesize a 3′-T₅-50mer oligonucleotide sequence in which 50 nucleotides are complementary to a “hypothetical” 50mer sequence. T₅ serves as a spacer. Deprotection with saturated ammonia in methanol at room temperature for 2 hours furnished according to the determination of the DMT group CPG which contained about 10 umol 55mer/g CPG. This 55mer served as a template for hybridizations with a 26-mer (with 5′-DMT group) and a 40-mer (without DMT group). The reaction volume is 100 μl and contains about 1 nmol CPG bound 55mer as template, an equimolar amount of oligonucleotide in solution (26-mer or 40-mer) in 20 mM Tris-HCl, pH 7.5, 10 mM MgCl₂ and 25 mM NaCl. The mixture was heated for 10 min at 65° C. and cooled to 37° C. during 30′(annealing). The oligonucleotide which has not been hybridized to the polymer-bound template were removed by centrifugation and three subsequent washing/centrifugation steps with 1 00 ul each of ice-cold 50 mM ammoniumcitrate. The beads were air-dried and mixed with matrix solution (3-hydroxypicolinic acid/10 mM ammonium citrate in acetonitrile/water, 1:1), and analyzed by MALDI-TOF mass spectrometry. The results are presented in FIGS. 10 and 11.

EXAMPLE 2

[0321] Electrospray (ES) desorption and differentiation of an 18-mer and 19-mer

[0322] DNA fragments at a concentration of 50 pmole/ul in 2-propanol/10 mM ammoniumcarbonate (1/9, v/v) were analyzed simultaneously by an electrospray mass spectrometer.

[0323] The successful desorption and differentiation of an 18-mer and 19-mer by electrospray mass spectrometry is shown in FIG. 12.

EXAMPLE 3

[0324] Detection of The Cystic Fibrosis Mutation ΔF508, by single step dideoxy extension and analysis by MALDI-TOF mass spectrometry (Competitive Oligonucleotide Simple Base Extension=COSBE)

[0325] The principle of the COSBE method is shown in FIG. 13, N being the normal and M the mutation detection primer, respectively.

[0326] MATERIALS AND METHODS

[0327] PCR Amplification and Strand Immobilization. Amplification was carried out with exon 10 specific primers using standard PCR conditions (30 cycles: 1′@95° C., 1′@55° C., 2@72° C.); the reverse primer was 5′ labelled with biotin and column purified (Oligopurification Cartridge, Cruachem). After amplification the amplified products were purified by column separation (Qiagen Quickspin) and immobilized on streptavidin coated magnetic beads (Dynabeads, Dynal, Norway) according to their standard protocol; DNA was denatured using 0.1 M NaOH and washed with 0.1M NaOH,1×B+W buffer and TE buffer to remove the non-biotinylated sense strand.

[0328] COSBE Conditions. The beads containing ligated antisense strand were resuspended in 1 8 μl of Reaction mix 1 (2 μl 10×Taq buffer, 1 μL (1 unit) Taq Polymerase, 2 μL of 2 mM dGTP, and 13 μL H₂O) and incubated at 80° C. for 5′ before the addition of Reaction mix 2 (100 ng each of COSBE primers). The temperature was reduced to 60° C. and the mixtures incubated for a 5′ annealing/extension period; the beads were then washed in 25 mM triethylammonium acetate (TEAA) followed by 50 mM ammonium citrate.

[0329] Primer Sequences. All primers were synthesized on a Perseptive Biosystems Expedite 8900 DNA Synthesizer using conventional phosphoramidite chemistry (Sinha et al. (1984) Nucleic Acids Res. 12:4539). COSBE primers (each containing an intentional mismatch one base before the 3′-terminus) were those used in a previous ARMS study (Ferrie et al. (1992) Am J Hum Genet 51:251-262) with the exception that two bases were removed from the 5′-end of the normal: Ex10 PCR (Forward): 5′-BIO-GCA AGT GAA TCC TGA GCG TG-3′ (SEQ ID No. 1) Ex10 PCR (Reverse): 5′-GTG TGA AGG GTT CAT ATG C-3′ (SEQ ID No. 2) COSBE ΔF508-N 5′-ATC TAT ATT CAT CAT AGG AAA CAC CAC A-3′ (SEQ ID No. 3) (28-mer) COSBE ΔF508-N 5′-GTA TCT ATA TTC ATC ATA GGA AAC ACC ATT-3′ (SEQ ID No. 4) (30-mer)

[0330] Mass Spectrometry. After washing, beads were resuspended in 1 μL 18 Mohm/cm H₂O. 300 nL each of matrix (Wu et al. (1993) Rapid Commun. Mass Spectrom. 7:142-146) solution (0.7 M 3-hydroxypicolinic acid, 0.7 M dibasic ammonium citrate in 1:1 H₂O:CH₃CN) and resuspended beads (Tang et al. (1995) Rapid Commun Mass Spectrom 8:727-730) were mixed on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular weights (Mr(calc)) were calculated from atomic compositions. Vendor provided software was used to determine peak centroids using external calibration; 1.08 Da has been subtracted from these to correct for the charge carrying proton mass to yield the text M_(r)(exp) values.

[0331] Scheme. Upon annealing to the bound template, the N and M primers (8508.6 and 9148.0 Da, respectively) are presented with dGTP; only primers with proper Watson-Crick base paring at the variable (V) position are extended by the polymerase. Thus if V pairs with the 3′-terminal base of N, N is extended to a 8837.9 Da product (N+1). Likewise, if V is properly matched to the M terminus, M is extended to a 9477.3 Da M+1 product.

[0332] Results

[0333] FIGS. 14-18 show the representative mass spectra of COSBE reaction products. Better results were obtained when amplified products were purified before the biotinylated anti-sense strand was bound.

EXAMPLE 4

[0334] Differentiation of Human Apolipoprotein E Isoforms by Mass Spectrometry

[0335] Apolipoprotein E (Apo E), a protein component of lipoproteins, plays an essential role in lipid metabolism. For example, it is involved with cholesterol transport, metabolism of lipoprotein particles, immunoregulation and activation of a number of lipolytic enzymes.

[0336] There are three common isoforms of human Apo E (coded by E2, E3 and E4 alleles). The most common is the E3 allele. The E2 allele has been shown to decrease the cholesterol level in plasma and therefore may have a protective effect against the development of atherosclerosis. The DNA encoding a portion of the E2 allele is set forth in SEQ ID No. 130. Finally, the E4 isoform has been correlated with increased levels of cholesterol, conferring predisposition to atherosclerosis. Therefore, the identity of the apo E allele of a particular individual is an important determinant of risk for the development of cardiovascular disease.

[0337] As shown in FIG. 19, a sample of DNA encoding apolipoprotein E can be obtained from a subject, amplified (e.g., via PCR); and the amplified product can be digested using an appropriate enzyme (e.g., Cfol). The restriction digest obtained can then be analyzed by a variety of means. As shown in FIG. 20, the three isotypes of apolipoprotein E (E2, E3 and E4 have different nucleic acid sequences and therefore also have distinguishable molecular weight values.

[0338] As shown in FIG. 21A-C, different Apolipoprotein E genotypes exhibit different restriction patterns in a 3.5% MetPhor Agarose Gel or 12% polyacrylamide gel. As shown in FIGS. 22 and 23, the various apolipoprotein E genotypes can also be accurately and rapidly determined by mass spectrometry.

EXAMPLE 5

[0339] Detection of hepatitis B virus in serum samples.

[0340] MATERIALS AND METHODS

[0341] Sample preparation

[0342] Phenol/choloform extraction of viral DNA and the final ethanol precipitation was done according to standard protocols.

[0343] First PCR

[0344] Each reaction was performed with 5 μl of the DNA preparation from serum. 15 pmol of each primer and 2 units Taq DNA polymerase (Perkin Elmer, Weiterstadt, Germany) were used. The final concentration of each dNTP was 200 μMM, the final volume of the reaction was 50 μl. 10×PCR buffer (Perkin Elmer, Weiterstadt, Germany) contained 100 mM Tris-HCl, pH 8.3, 500 mM KCl, 15 mM MgCl₂, 0.01% gelatine (w/v). Primer sequences: Primer SEQUENCE SEQ ID No. 1 5′-GCTTTGGGGCATGGACATTGACCCGTATAA-3′ 5 2 5′-CTGACTACTAATTCCCTGGATGCTGGGTCT-3′ 6

[0345] Nested PCR:

[0346] Each reaction was performed either with 1 μl of the first reaction or with a 1:10 dilution of the first PCR as template, respectively. 100 pmol of each primer, 2.5 u Pfu(exo-) DNA polymerase (Stratagene, Heidelberg, Germany), a final concentration of 200 pM of each dNTPs and 5 μl 10×Pfu buffer (200 mM Tris-HCl, pH 8.75, 100 mM KCl, 100 mM (NH₄)₂SO₄, 1% Triton X-100, 1 mg/ml BSA, (Stratagene, Heidelberg, Germany) were used in a final volume 50 μl. The reactions were performed in a thermocycler (OmniGene, MWG-Biotech, Ebersberg, Germany) using the following program: 92° C. for 1 minute, 60° C. for 1 minute and 72° C. for 1 minute with 20 cycles. Sequence of oligodeoxynucleotides (purchased HPLC-purified from MWG-Biotech, Ebersberg, Germany): HBV13: 5′-TTGCCTGAGTGCAGTATGGT-3′ (SEQ ID NO. 7) HBV15bio: Biotin-5′-AGCTCTATATCGGGAAGCCT-3′ (SEQ ID NO. 8)

[0347] Purification of amplified products:

[0348] For the recording of each spectrum, one PCR, 50 μl, (performed as described above) was used. Purification was done according to the following procedure: Ultrafiltration was done using Ultrafree-MC filtration units (Millipore, Eschborn, Germany) according to the protocol of the provider with centrifugation at 8000 rpm for 20 minutes. 25 μl (10 μg/μl) streptavidin Dynabeads (Dynal, Hamburg, Germany) were prepared according to the instructions of the manufacturer and resuspended in 25 μl of B/W buffer (10 mM Tris-HCl, pH 7.5, 1mM EDTA, 2 M NaCl). This suspension was added to the PCR samples still in the filtration unit and the mixture was incubated with gentle shaking for 15 minutes at ambient temperature. The suspension was transferred in a 1.5 ml Eppendorf tube and the supernatant was removed with the aid of a Magnetic Particle Collector, MPC, (Dynal, Hamburg, Germany). The beads were washed twice with 50 μl of 0.7 M ammonium citrate solution, pH 8.0 (the supernatant was removed each time using the MPC). Cleavage from the beads can be accomplished by using formamide at 90° C. The supernatant was dried in a speedvac for about an hour and resuspended in 4 μl of ultrapure water (MilliQ UF plus Millipore, Eschborn, Germany). This preparation was used for MALDI-TOF MS analysis. MALDI-TOF MS:

[0349] Half a microliter of the sample was pipetted onto the sample holder, then immediately mixed with 0.5 μl matrix solution (0.7 M3-hydroxypicolinic acid 50% acetonitrile, 70 mM ammonium citrate). This mixture was dried at ambient temperature and introduced into the mass spectrometer. All spectra were taken in positive ion mode using a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany), equipped with a reflectron (5 keV ion source, 20 keV postacceleration) and a 337 nm nitrogen laser. Calibration was done with a mixture of a 40-mer and a 100-mer. Each sample was measured with different laser energies. In the negative samples, the amplified product was detected neither with less nor with higher laser energies. In the positive samples the amplified product was detected at different places of the sample spot and also with varying laser energies.

[0350] RESULTS

[0351] A nested PCR system was used for the detection of HBV DNA in blood samples employing oligonucleotides complementary to the c region of the HBV genome (primer 1: beginning at map position 1 763, primer 2 beginning at map position 2032 of the complementary strand) encoding the HBV core antigen (HBVcAg). DNA was isolated from patients serum according to standard protocols. A first PCR was performed with the DNA from these preparations using a first set of primers. If HBV DNA was present in the sample a DNA fragment of 269 bp was generated.

[0352] In the second reaction, primers which were complementary to a region within the PCR fragment generated in the first PCR were used. If HBV related amplified products were present in the first PCR a DNA fragment of 67 bp was generated (see FIG. 25A) in this nested PCR. The usage of a nested PCR system for detection provides a high sensitivity and also serves as a specificity control for the external PCR (Rolfs et al. (1992) PCR: Clinical Diagnostics and Research, Springer, Heidelberg). A further advantage is that the amount of fragments generated in the second PCR is high enough to ensure an unproblematic detection although purification losses can not be avoided.

[0353] The samples were purified using ultrafiltration to restreptavidin Dynabeads. This purification was done because the shorter primer fragments were immobilized in higher yield on the beads due to stearic reasons. The immobilization was done directly on the ultrafiltration membrane to avoid substance losses due to unspecific absorption on the membrane. Following immobilization, the beads were washed with ammonium citrate to perform cation exchange (Pieles et al. (1993) Nucl. Acids Res. 21:3191-3196). The immobilized DNA was cleaved from the beads using 25% ammonia which allows cleavage of DNA from the beads in a very short time, but does not result in an introduction of sodium or other cations.

[0354] The nested PCRs and the MALDI TOF analysis were performed without knowing the results of serological analysis. Due to the unknown virus titer, each sample of the first PCR was used undiluted as template and in a 1:10 dilution, respectively.

[0355] Sample 1 was collected from a patient with chronic active HBV infection who was positive in Hbs- and Hbe-antigen tests but negative in a dot blot analysis. Sample 2 was a serum sample from a patient with an active HBV infection and a massive viremia who was HBV positive in a dot blot analysis. Sample 3 was a denatured serum sample therefore no serological analysis could be performed by an increased level of transaminases indicating liver disease was detected. In autoradiograph analysis (FIG. 24), the first PCR of this sample was negative. Nevertheless, there was some evidence of HBV infection. This sample is of interest for MALDI-TOF analysis, because it demonstrates that even low-level amounts of amplified products can be detected after the purification procedure. Sample 4 was from a patient who was cured of HBV infection. Samples 5 and 6 were collected from patients with a chronic active HBV infection.

[0356]FIG. 24 shows the results of a PAGE analysis of the nested PCR reaction. A amplified product is clearly revealed in samples 1, 2, 3, 5 and 6. In sample 4 no amplified product was generated, it is indeed HBV negative, according to the serological analysis. Negative and positive controls are indicated by + and −, respectively. Amplification artifacts are visible in lanes 2, 5, 6 and + if non-diluted template was used. These artifacts were not generated if the template was used in a 1:10 dilution. In sample 3, amplified product was merely detectable if the template was not diluted. The results of PAGE analysis are in agreement with the data obtained by serological analysis except for sample 3 as discussed above.

[0357]FIG. 25A shows a mass spectrum of a nested amplified product from sample number 1 generated and purified as described above. The signal at 20754 Da represents the single stranded amplified product (calculated: 20735 Da, as the average mass of both strands of the amplified product cleaved from the beads). The mass difference of calculated and obtained mass is 19 Da (0.09%). As shown in FIG. 25A, sample number 1 generated a high amount of amplified product, resulting in an unambiguous detection.

[0358]FIG. 25B shows a spectrum obtained from sample number 3. As depicted in FIG. 24, the amount of amplified product generated in this section is significantly lower than that from sample number 1. Nevertheless, the amplified product is clearly revealed with a mass of 20751 Da (calculated 20735). The mass difference is 16 Da (0.08%). The spectrum depicted in FIG. 25C was obtained from sample number 4 which is HBV negative (as is also shown in FIG. 24). As expected no signals corresponding to the amplified product could be detected. All samples shown in FIG. 25 were analyzed with MALDI-TOF MS, whereby amplified product was detected in all HBV positive samples, but not in the HBV negative samples. These results were reproduced in several independent experiments.

EXAMPLE 6

[0359] Analysis of Ligase Chain Reaction Products Via MALDI-TOF Mass Spectrometry

[0360] MATERIALS AND METHODS

[0361] Oligodeoxynucleotides

[0362] Except the biotinylated one and all other oligonucleotides were synthesized in a 0.2 μmol scale on a MilliGen 7500 DNA Synthesizer (Millipore, Bedford, Mass., USA) using the β-cyanoethylphosphoamidite method (Sinha, N. D. et al. (1984) Nucleic Acids Res. 12:4539-4577). The oligodeoxynucleotides were RP-HPLC-purified and deprotected according to standard protocols. The biotinylated oligodeoxynucleotide was purchased (HPLC-purified) from Biometra, Gottingen, Germany). Sequences and calculated masses of the oligonucleotides used: Oligodeoxy- SEQ ID nucleotide SEQUENCE No. A 5′-p-TTGTGCCACGCGGTTGGGAATGTA (7521 Da)  9 B 5′-p-AGCAACGACTGTTTGCCCGCCAGTTG (7948 Da) 10 C 5′-bio-TACATTCCCAACCGGGTGGCACAAC (7960 Da) 11 D 5′-p-AACTGGCGGGCAAACAGTCGTTGCT (7708 Da) 12

[0363] 5-Phosphorylation of oligonucleotides A and D

[0364] This was performed with polynucleotide kinase (Boehringer, Mannheim, Germany) according to published procedures, the 5′-phosphorylated oligonucleotides were used unpurified for LCR.

[0365] Ligase chain reaction

[0366] The LCR was performed with Pfu DNA ligase and a ligase chain reaction kit (Stratagene, Heidelberg, Germany) containing two different pBluescript KII phagemids. One carrying the wildtype form of the E. coli lacl gene and the other one a mutant of this gene with a single point mutation at bp 191 of the lacl gene.

[0367] The following LCR conditions were used for each reaction: 100 pg template DNA (0.74 fmol) with 500 pg sonified salmon sperm DNA as carrier, 25 ng (3.3 pmol) of each 5′-phosphorylated oligonucleotide, 20 ng (2.5 pmol) of each non-phosphorylated oligonucleotide, 4 U Pfu DNA ligase in a final volume of 20 μl buffered ss 50-mer was used (I fmol) as template, in this case oligo C was also biotinylated. All reactions were performed in a thermocycler (OmniGene, MWG-Biotech, Ebersberg, Germany) with the following program: 4 minutes 92° C., 2 minutes 60° C. and 25 cycles of 20 seconds 92° C., 40 seconds 60° C. Except for HPLC analysis the biotinylated ligation educt C was used. In a control experiment the biotinylated and non-biotinylated oligonucleotides revealed the same gel electrophoretic results. The reactions were analyzed on 7.5% polyacrylamide gels. Ligation product 1 (oligo A and B) calculated mass: 15450 Da, ligation product 2 (oligo C and D) calculated mass: 15387 Da.

[0368] SMART-HPLC

[0369] Ion exchange HPLC (IE HPLC) was performed on the SMART-system (Pharmacia, Freiburg, Germany) using a Pharmacia Mono Q, PC 1.6/5 column. Eluents were buffer A (25 mM Tris-HCl, 1 mM EDTA and 0.3 M NaCl at pH 8.0) and buffer B (same as A, but 1 M NaCl). Starting with 100% A for 5 minutes at a flow rate of 50 μl/min. a gradient was applied from 0 to 70% B in 30 minutes, then increased to 100% B in 2 minutes and held at 100% B for 5 minutes. Two pooled LCR volumes (40 μl) performed with either wildtype or mutant template were injected.

[0370] Sample preparation for MALDI-TOF-MS

[0371] Preparation of immobilized DNA: For the recording of each spectrum two LCRs (performed as described above) were pooled and diluted 1:1 with 2×B/W buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 2 M NaCl). To the samples 5 μl streptavidin DynaBeads (Dynal, Hamburg, Germany) were added, the mixture was allowed to bind with gentle shaking for 15 minutes at ambient temperature. The supernatant was removed using a Magnetic Particle Collector, MPC, (Dynal, Hamburg, Germany) and the beads were washed twice with 50 μl of 0.7 M ammonium citrate solution (pH 8.0) (the supernatant was removed each time using the MPC). The beads were resuspended in 1 μl of ultrapure water (MilliQ, Millipore, Bedford, Mabelow).

[0372] Combination of ultrafiltration and streptavidin DynaBeads: For the recording of spectrum two LCRs (performed as described above) were pooled, diluted 1:1 with 2×B/W buffer and concentrated with a 5000 NMWL Ultrafree-MC filter unit (Millipore, Eschborn, Germany) according to the instructions of the manufacturer. After concentration the samples were washed with 300 μl 1×B/W buffer to streptavidin DynaBeads were added. The beads were washed once on the Ultrafree-MC filtration unit with 300 μl of 1×B/W buffer and processed as described above. The beads were resuspended in 30 to 50 μl of 1×B/W buffer and transferred in a 1.5 ml Eppendorf tube. The supernatant was removed and the beads were washed twice with 50 μl of 0.7 M ammonium citrate (pH 8.0). Finally, the beads were washed once with 30 μl of acetone and resuspended in 1 μl of ultrapure water. The ligation mixture after immobilization on the beads was used for MALDS-TOF-MS analysis as described below.

[0373] MALDI-TOF-MS

[0374] A suspension of streptavidin-coated magnetic beads with the immobilized DNA was pipetted onto the sample holder, then immediately mixed with 0.5 μl matrix solution (0.7 M 3-hydroxypicolinic acid in 50% acetonitrile, 70 mM ammonium citrate). This mixture was dried at ambient temperature and introduced into the mass spectrometer. All spectra were taken in positive ion mode using a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany), equipped with a reflectron (5 keV ion source, 20 keV postacceleration) and a nitrogen laser (337 nm). For the analysis of Pfu DNA ligase 0.5 μl of the solution was mixed on the sample holder with 1 μl of matrix solution and prepared as described above. For the analysis of unpurified LCRs 1 μl of an LCR was mixed with 1 μl matrix solution.

[0375] RESULTS

[0376] The E. coli lacl gene served as a simple model system to investigate the suitability of MALDI-TOF-MS as detection method for products generated in ligase chain reactions. This template system contains of an E. coli lacl wildtype gene in a pBluescript KII phagemid and an E. coli lacl gene carrying a single point mutation at bp 191 (C to T transition; SEQ ID No. 131) in the same phagemid. Four different oligonucleotides were used, which were ligated only if the E coli lacl wildtype gene was present (FIG. 26).

[0377] LCR conditions were optimized using Pfu DNA ligase to obtain at least 1 pmol ligation product in each positive reaction. The ligation reactions were analyzed by polyacrylamide gel electrophoresis (PAGE) and HPLC on the SMART system (FIGS. 27, 28 and 29). FIG. 27 shows a PAGE of a positive LCR with wildtype template (lane 1), a negative LCR with mutant template (1 and 2) and a negative control which contains enzyme, oligonucleotides and no template but salmon sperm DNA. The gel electrophoresis clearly shows that the ligation product (50 bp) was produced only in the reaction with wildtype template; whereas neither the template carrying the point mutation nor the control reaction with salmon sperm DNA generated amplification products. In FIG. 28, HPLC was used to analyze two pooled LCRs with wildtype template performed under the same conditions. The ligation product was clearly revealed. FIG. 29 shows the results of a HPLC in which two pooled negative LCRs with mutant template were analyzed. These chromatograms confirm the data shown in FIG. 27 and the results taken together clearly demonstrate, that the system generates ligation products in a significant amount only if the wildtype template is provided.

[0378] Appropriate control runs were performed to determine retention times of the different compounds involved in the L CR experiments. These include the four oligonucleotides (A, B, C, and D), a synthetic ds 50-mer (with the same sequence as the ligation product), the wildtype template DNA, sonicated salmon sperm DNA and the Pfu DNA ligase in ligation buffer.

[0379] In order to test which purification procedure should be used before a LCR reaction can be analyzed by MALDI-TOF-MS, aliquots of an unpurified LCR (FIG. 30A) and aliquots of the enzyme stock solution (FIG. 30B) were analyzed with MALDI-TOF-MS. It turned out that appropriate sample preparation is absolutely necessary since all signals in the unpurified LCR correspond to signals obtained in the MALDI-TOF-MS analysis of the Pfu DNA ligase. The calculated mass values of oligo A and the ligation product are 7521 Da and 15450 Da, respectively. The data in FIG. 30 show that the enzyme solution leads to mass signals which do interfere with the expected signals of the ligation educts and products and therefore makes an unambiguous signal assignment impossible. Furthermore, the spectra showed signals of the detergent Tween20 being part of the enzyme storage buffer which influences the crystallization behavior of the analyte/matrix mixture in an unfavorable way.

[0380] In one purification format streptavidin-coated magnetic beads were used. As was shown in a recent paper, the direct desorption of DNA immobilized by Watson-Crick base pairing to a complementary DNA fragment covalently bound to the beads is possible and the non-biotinylated strand will be desorbed exclusively (Tang et al. (1995) Nucleic Acids Res. 23:3126-3131). This approach in using immobilized ds DNA ensures that only the non-biotinylated strand will be desorbed. If non-immobilized ds DNA is analyzed both strands are desorbed (Tang et al. (1994) Rapid Comm. Mass Spectrom. 7 183-186) leading to broad signals depending on the mass difference of the two single strands. Therefore, employing this system for LCR only the non-ligated oligonucleotide A, with a calculated mass of 7521 Da, and the ligation product from oligo A and oligo B (calculated mass: 15450 Da) will be desorbed if oligo C is biotinylated at the 5′-end and immobilized on steptavidin-coated beads. This results in a simple and unambiguous identification of the LCR educts and products.

[0381]FIG. 31A shows a MALDI-TOF mass spectrum obtained from two pooled LCRs (performed as described above) purified on streptavidin DynaBeads and desorbed directly from the beads showed that the purification method used was efficient (compared with FIG. 30). A signal which represents the unligated oligo A and a signal which corresponds to the ligation product could be detected. The agreement between the calculated and the experimentally found mass values is remarkable and allows an unambiguous peak assignment and accurate detection of the ligation product. In contrast, no ligation product but only oligo A could be detected in the spectrum obtained from two pooled LCRs with mutated template (FIG. 31B). The specificity and selectivity of the LCR conditions and the sensitivity of the MALDI-TOF detection is further demonstrated when performing the ligation reaction in the absence of a specific template. FIG. 32 shows a spectrum obtained from two pooled LCRs in which only salmon sperm DNA was used as a negative control, only oligo A could be detected, as expected.

[0382] While the results shown in FIG. 31A can be correlated to lane 1 of the gel in FIG. 27, the spectrum shown in FIG. 31B is equivalent to lane 2 in FIG. 27, and finally also the spectrum in FIG. 32 corresponds to lane 3 in FIG. 27. The results are in congruence with the HPLC analysis presented in FIGS. 28 and 29. While gel electrophoresis (FIG. 27) and HPLC (FIGS. 28 and 29) reveal either an excess or almost equal amounts of ligation product over ligation educts, the analysis by MALDI-TOF mass spectrometry produces a smaller signal for the ligation product (FIG. 31A).

[0383] The lower intensity of the ligation product signal could be due to different desorption/ionization efficiencies between 24- and a 50-mer. Since the T_(m) value of a duplex with 50 compared to 24 base pairs is significantly higher, more 24-mer could be desorbed. A reduction in signal intensity can also result from a higher degree of fragmentation in case of the longer oligonucleotides.

[0384] Regardless of the purification with streptavidin DynaBeads, FIG. 32 reveals traces of Tween20 in the region around 2000 Da. Substances with a viscous consistence, negatively influence the process of crystallization and therefore can be detrimental to mass spectrometer analysis. Tween20 and also glycerol which are part of enzyme storage buffers therefore should be removed entirely prior to mass spectrometer analysis. For this reason an improved purification procedure which includes an additional ultrafiltration step prior to treatment with DynaBeads was investigated. Indeed, this sample purification resulted in a significant improvement of MALDI-TOF mass spectrometric performance.

[0385]FIG. 33 shows spectra obtained from two pooled positive (FIG. 33A) and negative (FIG. 33B) LCRs, respectively. The positive reaction was performed with a chemically synthesized, single strand 50mer as template with a sequence equivalent to the ligation product of oligo C and D. Oligo C was 5′-biotinylated. Therefore the template was not detected. As expected, only the ligation product of Oligo A and B (calculated mass 15450 Da) could be desorbed from the immobilized and ligated oligo C and D. This newly generated DNA fragment is represented by the mass signal of 15448 Da in FIG. 33A. Compared to FIG. 32A, this spectrum clearly shows that this method of sample preparation produces signals with improved resolution and intensity.

EXAMPLE 7

[0386] Mutation detection by solid phase oligo base extension of a primer and analysis by MALDI-TOF mass spectrometry (Primer Oligo Base Extension =Probe)

[0387] Summary

[0388] The solid-phase oligo base extension method detects point mutations and small deletions as well as small insertions in amplified DNA. The method is based on the extension of a detection primer that anneals adjacent to a variable nucleotide position on an affinity-captured amplified template, using a DNA polymerase, a mixture of three dNTPs, and the missing one dideoxy nucleotide. The resulting products are evaluated and resolved by MALDI-TOF mass spectrometry without further labeling procedures. The aim of the following experiment was to determine mutant and wildtype alleles in a fast and reliable manner.

[0389] Description of the experiment

[0390] The method used a single detection primer followed by a oligonucleotide extension step to give products differing in length by some bases specific for mutant or wildtype alleles which can be easily resolved by MALDI-TOF mass spectrometry. The method is described by using as example the exon 10 of the CFTR-gene. Exon 10 of this gene bears the most common mutation in many ethnic groups (ΔF508) that leads in the homozygous state to the clinical phenotype of cystic fibrosis.

[0391] MATERIALS AND METHODS

[0392] Genomic DNA

[0393] Genomic DNA were obtained from healthy individuals, individuals homozygous or heterozygous for the ΔF508 mutation, and one individual heterozygous for the 1506S mutation. The wildtype and mutant alleles were confirmed by standard Sanger sequencing.

[0394] PCR amplification of exon 10 of the CFTR gene

[0395] The primers for PCR amplification were CFEx10-F (5-GCAAGTGAATCCTGAGCGTG-3′ (SEQ ID No. 13) located in intron 9 and biotinylated) and CFEx10-R (5′-GTGTGAAGGGCGTG-3′SEQ ID No. 14) located in intron 10). Primers were used in a concentration of 8 pmol. Taq-polymerase including 10×buffer were purchased from Boehringer-Mannheim and dTNPs were obtained from Pharmacia. The total reaction volume was 50 μl. Cycling conditions for PCR were initially 5 min. at 95° C., followed by 1 min. at 94° C., 45 sec at 53° C., and 30 sec at 72° C. for 40 cycles with a final extension time of 5 min at 72° C.

[0396] Purification of the amplified products

[0397] Amplification products were purified by using Qiagen's PCR purification kit (No. 28106) according to manufacturer's instructions. The elution of the purified products from the column was done in 50 μl TE-buffer (10 mM Tris, 1 mM EDTA, pH 7,5).

[0398] Affinity-capture and denaturation of the double stranded DNA

[0399] 10 μL aliquots of the purified amplified product were transferred to one well of a streptavidin-coated microtiter plate (No. 1645684 Boehringer-Mannheim or No. 95029262 Labsystems). Subsequently, 10 μl incubation buffer (80 mM sodium phosphate, 400 mM NaCl, 0,4% Tween20, pH 7,5) and 30 μl water were added. After incubation for 1 hour at room temperature the wells were washed three times with 200 μl washing buffer (40 mM Tris, 1 mM EDTA, 50 mM NaCl, 0.1% Tween 20, pH 8.8). To denature the double stranded DNA the wells were treated with 100 μl of a 50 mM NaOH solution for 3 min and the wells washed three times with 200 μl washing buffer.

[0400] Oligo base extension reaction

[0401] The annealing of 25 pmol detection primer (CF508: 5′-CTATATTCATCATAGGAAACACCA-3′ (SEQ ID No. 15) was performed in 50 μl annealing buffer (20 mM Tris, 10 mM KCl, 10 mM (NH₄)₂SO₄, 2 mM MgSO2, 1 % Triton X-100, pH 8) at 50° C. for 10 min. The wells were washed three times with 200 μl washing buffer and once in 200 μl TE buffer. The extension reaction was performed by using some components of the DNA sequencing kit from USB (No. 70770) and dNTPs or ddNTPs from Pharmacia. The total reaction volume was 45 μl, containing of 21 μl water, 6 μl Sequenase-buffer, 3 μl 10 mM DTT solution, 4,5 μl, 0,5 mM of three dNTPs, 4,5 μl, 2 mM the missing one ddNTP, 5,5,ul glycerol enzyme dilution buffer, 0,25 μl Sequenase 2.0, and 0,25 pyrophosphatase. The reaction was pipetted on ice and then incubated for 15 min at room temperature and for 5 min at 37° C. Hence, the wells were washed three times with 200 μl washing buffer and once with 60 μl of a 70 mM NH₄-Citrate solution.

[0402] Denaturation and precipitation of the extended primer

[0403] The extended primer was denatured in 50 μl 10%-DMSO (dimethylsufoxide) in water at 80° C. for 10 min. For precipitation, 10 NH₄-Acetate (pH 6.5), 0,5 μl glycogen (10 mg/ml water, Sigma No. G1765), and 100 μl absolute ethanol were added to the supernatant and incubated for 1 hour at room temperature. After centrifugation at 13.000 g for 10 min the pellet was washed in 70% ethanol and resuspended in 1 μl 18 Mohm/cm H₂O water.

[0404] Sample preparation and analysis on MALDI-TOF mass spectrometry

[0405] Sample preparation was performed by mixing 0,3 μl of each of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium citrate in 1:1 H₂O:CH₃CN) and of resuspended DNA/glycogen pellet on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of an unmodified Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (Mr(calc)) were calculated from atomic compositions; reported experimental Mr (Mr(exp)) values are those of the singly-protonated form, determined using external calibration.

[0406] RESULTS

[0407] The aim of the experiment was to develop a fast and reliable method independent of exact stringencies for mutation detection that leads to high quality and high throughput in the diagnosis of genetic diseases. Therefore a special kind of DNA sequencing (oligo base extension of one mutation detection primer) was combined with the evaluation of the resulting mini-sequencing products by matrix-assisted laser desorption ionization (MALDI) mass spectrometry (MS). The time-of-flight (TOF) reflectron arrangement was chosen as a possible mass measurement system. To prove this hypothesis, the examination was performed with exon 10 of the CFTR-gene, in which some mutations could lead to the clinical phenotype of cystic fibrosis, the most common monogenetic disease in the Caucasian population.

[0408] The schematic presentation as given in FIG. 34 shows the expected short sequencing products with the theoretically calculated molecular mass of the wildtype and various mutations of exon 10 of the CFTR-gene (SEQ ID No. 132). The short sequencing products were produced using either ddTTP (FIG. 34A; SEQ ID Nos. 133-135) or ddCTP (FIG. 34B; SEQ ID Nos. 136-139) to introduce a definitive sequence related stop in the nascent DNA strand. The MALDI-TOF-MS spectra of healthy, mutation heterozygous, and mutation homozygous individuals are presented in FIG. 35. All samples were confirmed by standard Sanger sequencing which showed no discrepancy in comparison to the mass spec analysis. The accuracy of the experimental measurements of the various molecular masses was within a range of minus 21.8 and plus 87.1 dalton (Da) to the range expected. This allows a definitive interpretation of the results in each case. A further advantage of this procedure is the unambiguous detection of the A1507 mutation. In the ddTTP reaction, the wildtype allele would be detected, whereas in the ddCTP reaction the three base pair deletion would be disclosed.

[0409] The method described is highly suitable for the detection of single point mutations or microlesions of DNA. Careful choice of the mutation detection primers will open the window of multiplexing and lead to a high throughput including high quality in genetic diagnosis without any need for exact stringencies necessary in comparable allele-specific procedures. Because of the uniqueness of the genetic information, the oligo base extension of mutation detection primer is applicable in each disease gene or polymorphic region in the genome like variable number of tandem repeats (VNTR) or other single nucleotide polymorphisms (e.g., apolipoprotein E gene), as also described here.

EXAMPLE 8 Detection of Polymerase Chain Reaction Products Containing 7-Deazapurine Moieties with Matrix-Assisted Laser Desorption/lonization Time-of-Flight (MALDI-TOF) Mass Spectrometry

[0410] MATERIALS AND METHODS

[0411] Nucleic acid amplifications

[0412] The following oligodeoxynucleotide primers were either synthesized according to standard phosphoamidite chemistry (Sinha, N. D,. et al., (1983) Tetrahedron Let. Vol. 24, Pp. 5843-5846; Sinha, N. D., et al, (1984) Nucleic Acids Res., Vol. 12, Pp. 4539-4557) on a MilliGen 7500 DNA synthesizer (Millipore, Bedford, Mass., USA) in 200 nmol scales or purchased from MWG-Biotech (Ebersberg, Germany, primer 3) and Biometra (Goettingen, Germany, primers 6-7). primer 1: 5′-GTCACCCTCGACCTGCAG (SEQ ID NO. 16); primer 2: 5′-TTGTAAAACGACGGCCAGT (SEQ ID NO. 17); primer 3: 5′-CTTCCACCGCGATGTTGA (SEQ ID NO. 18); primer 4: 5′-CAGGAAACAGCTATGAC (SEQ ID NO. 19); primer 5: 5′-GTAAAACGACGGCCAGT (SEQ ID NO. 20); primer 6: 5′-GTCACCCTCGACCTGCAgC (g: RiboG) (SEQ ID NO. 21); primer 7: 5′-GTTGTAAAACGAGGGCCAgT (g: RiboG) (SEQ ID NO. 22);

[0413] The 99-mer (SEQ ID No. 141) and 200-mer DNA strands (SEQ ID No. 140; modified and unmodified) as well as the ribo- and 7-deaza-modified 100-mer were amplified from pRFc1 DNA (10 ng, generously supplied by S. Feyerabend, University of Hamburg) in 100 μL reaction volume containing 10 mmol/L KCl, 10 mmol/L (NH₄)₂SO₄, 20 mmol/L Tris HCl (pH 8.8), 2 mmol/L MgSO₄, (exo(-) Pseudococcus furiosus (Pfu) -Buffer, Pharmacia, Freiburg, Germany), 0.2 mmol/L each dNTP (Pharmacia, Freiburg, Germany), 1 μmol/L of each primer and 1 unit of exo(-)Pfu DNA polymerase (Stratagene, Heidelberg, Germany). For the 99-mer primers 1 and 2, for the 200-mer primers 1 and 3 and for the 100-mer primers 6 and 7 were used. To obtain 7-deazapurine modified nucleic acids, during PCR-amplification dATP and dGTP were replaced with 7-deaza-dATP and 7-deaza- dGTP. The reaction was performed in a thermal cycler (OmniGene, MWG-Biotech, Ebersberg, Germany) using the cycle: denaturation at 95° C. for 1 min., annealing at 51° C. for 1 min. and extension at 72° C. for 1 min. For all PCRs the number of reaction cycles was 30. The reaction was allowed to extend for additional 10 min. at 72° C. after the last cycle.

[0414] The 103-mer DNA strands (modified and unmodified; SEQ ID No. 245) were amplified from M13mp18 RFI DNA (100 ng, Pharmacia, Freiburg, Germany) in 100 μL reaction volume. using primers 4 and 5 all other concentrations were unchanged. The reaction was performed using the cycle: denaturation at 95° C. for 1 min., annealing at 40° C. for 1 min. and extension at 72° C. for 1 min. After 30 cycles for the unmodified and 40 cycles for the modified 103-mer respectively, the samples were incubated for additional 10 min. at 72° C.

[0415] Synthesis of 5′-[³²-P]-Iabeled PCR-primers

[0416] Primers 1 and 4 were 5′-[³²-P]-labeled employing T4-polynucleotidkinase (Epicentre Technologies) and (γ-³²P)-ATP. (BLU/NGG/502A, Dupont, Germany) according to the protocols of the manufacturer. The reactions were performed substituting 10% of primer 1 and 4 in PCR with the labeled primers under otherwise unchanged reaction-conditions. The amplified DNAs were separated by gel electrophoresis on a 10% polyacrylamide gel. The appropriate bands were excised and counted on a Packard TRI-CARB 460C liquid scintillation system (Packard, Conn., USA).

[0417] Primer-cleavage from ribo-modified PCR-product

[0418] The amplified DNA was purified using Ultrafree-MC filter units (30,000 NMWL), it was then redissolved in 100 μl of 0.2 mol/L NaOH and heated at 95° C. for 25 minutes. The solution was then acidified with HCl (1 mol/L) and further purified for MALDI-TOF analysis employing Ultrafree-MC filter units (10,000 NMWL) as described below.

[0419] Purification of amplified products

[0420] All samples were purified and concentrated using Ultrafree-MC units 30000 NMWL (Millipore, Eschborn, Germany) according to the manufacturer's description. After lyophilization, amplified products were redissolved in 5 μL (3 μL for the 200-mer) of ultrapure water. This analyte solution was directly used for MALDI-TOF measurements.

[0421] MALDI-TOF MS

[0422] Aliquots of 0.5 μL of analyte solution and 0.5 μL of matrix solution (0.7 mol/L 3-HPA and 0.07 mol/L ammonium citrate in acetonitrile/water (1:1, v/v)) were mixed on a flat metallic sample support. After drying at ambient temperature the sample was introduced into the mass spectrometer for analysis. The MALDI-TOF mass spectrometer used was a Finnigan MAT Vision 2000 (Finnigan MAT, Bremen, Germany). Spectra were recorded in the positive ion reflector mode with a 5 keV ion source and 20 keV postacceleration. The instrument was equipped with a nitrogen laser (337 nm wavelength). The vacuum of the system was 3-4·10⁻⁸ hPa in the analyzer region and 1-4-10⁻⁷ hPa in the source region. Spectra of modified and unmodified DNA samples were obtained with the same relative laser power; external calibration was performed with a mixture of synthetic oligodeoxynucleotides (7-to 50-mer).

[0423] RESULTS AND DISCUSSION

[0424] Enzymatic synthesis of 7-deazapurine nucleotide containing nucleic acids by PCR

[0425] In order to demonstrate the feasibility of MALDI-TOF MS for the rapid, gel-free analysis of short amplified products and to investigate the effect of 7-deazapurine modification of nucleic acids under MALDI-TOF conditions, two different primer-template systems were used to synthesize DNA fragments. Sequences are displayed in FIGS. 36 and 37. While the two single strands of the 103-mer amplified product had nearly equal masses (Δm=8 u), the two single strands of the 99-mer differed by 526 u. Considering that 7-deaza purine nucleotide building blocks for chemical DNA synthesis are approximately 160 times more expensive than regular ones (Product Information, Glen Research Corporation, Sterling, VA) and their application in standard β-cyano-phosphoamidite chemistry is not trivial (Product Information, Glen Research Corporation, Sterling, Va.; Schneider et al. (1995) Nucl. Acids Res. 23:1570) the cost of 7-deaza purine modified primers would be very high. Therefore, to increase the applicability and scope of the method, all PCRs were performed using unmodified oligonucleotide primers which are routinely available. Substituting dATP and dGTP by c⁷-dATP and c⁷-dGTP in polymerase chain reaction led to products containing approximately 80% 7-deaza-purine modified nucleosides for the 99-mer and 103-mer; and about 90% for the 200-mer, respectively. Table II shows the base composition of all PCR products. TABLE II Base composition of the 99-mer, 103-mer and 200-mer PCR amplification products (unmodified and 7-deaza purine modified) rel. DNA-fragments¹ C T A G c⁷-deaza-A c⁷-deaza-6 mod.2 200-mers 54 34 56 56 — — — modified 200-mer 54 34 6 5 50 51 90% s 200-mer a 56 56 34 54 — — — modified 200-mer 56 56 3 4 31 50 92% a 103-mer s 28 23 24 28 — — — modified 103-mer 28 23 6 5 18 23 79% s 103-mer a 28 24 23 28 — — — modified 103-mer 28 24 7 4 16 24 78% a 99-mer s 34 21 24 20 — — — modified 99-mer 34 21 6 5 18 15 75% s 99-mer a 20 24 21 34 — — — modified 99-mer 20 24 3 4 18 30 87% a

[0426] It remained to be determined whether 80-90% 7-deaza-purine modification is sufficient for accurate mass spectrometer detection. It was therefore important to determine whether all purine nucleotides could be substituted during the enzymatic amplification step. This was not trivial since it had been shown that c⁷-dATP cannot fully replace dATP in PCR if Taq DNA polymerase is employed (Seela, F. and A. Roelling (1 992) Nucleic Acids Res., 20,55-61). Fortunately it was found that exo(-)Pfu DNA polymerase indeed could accept c⁷-dATP and c⁷-dGTP in the absence of unmodified purine nucleoside triphosphates. The incorporation was less efficient leading to a lower yield of amplified product (FIG. 38).

[0427] To verify these results, the amplications with [³²P]-labeled primers were repeated. The autoradiogram (FIG. 39) clearly shows lower yields for the modified PCR-products. The bands were excised from the gel and counted. For all amplified products the yield of the modified nucleic acids was about 50%, referring to the corresponding unmodified amplification product. Further experiments showed that exo(-)DeepVent and Vent DNA polymerase were able to incorporate c⁷-dATP and c⁷-dGTP during PCR as well. The overall performance, however, turned out to be best for the exo(-)Pfu DNA polymerase giving least side products during amplification. Using all three polymerases, it was found that such PCRs employing c⁷-dATP and c⁷-dGTP instead of their isosteres showed less side-reactions giving a cleaner PCR-product. Decreased occurrence of amplification side products may be explained by a reduction of primer mismatches due to a ling template which is synthesized during PCR. Decreased melting point for DNA duplexes containing 7-deaza-purine have been described (Mizusawa, S. etal., (1986) Nucleic Acids Res., 14, 1319-1324). In addition to the three polymerases specified above (exo(-) Deep Vent DNA polymerase, 5Vent DNA polymerase and exo(-) (Pfu) DNA polymerase), it is anticipated that other polymerases, such as the Large Klenow fragment of E. coli DNA polymerase, Sequenase, Taq DNA polymerase and U AmpliTaq DNA polymerase can be used. In addition, where RNA is the template, RNA polymerases, such as the SP6 or the T7 RNA polymerase, must be used.

[0428] MALDI-TOF mass spectrometry of modified and unmodified amplified products.

[0429] The 99-mer, 103-mer and 200-mer amplified products were analyzed by MALDI-TOF MS. Based on past experience, it was known that the degree of depurination depends on the laser energy used for desorption and ionization of the analyte. Since the influence of 7-deazapurine modification on fragmentation due to depurination was to be investigated, all spectra were measured at the same relative laser energy.

[0430]FIGS. 40a and 40 b show the mass spectra of the modified and unmodified 103-mer nucleic acids. In case of the modified 103-mer, fragmentation causes a broad (M +H)⁺ signal. The maximum of the peak is shifted to lower masses so that the assigned mass represents a mean value of (M+H)⁺ signal and signals of fragmented ions, rather than the (M+H)⁺ signal itself. Although the modified 103-mer still contains about 20% A and G from the oligonucleotide primers, it shows less fragmentation which is featured by much more narrow and symmetric signals. Especially peak tailing on the lower mass side due to depurination, is substantially reduced. Hence, the difference between measured and calculated mass is strongly reduced although it is still below the expected mass. For the unmodified sample a (M+H)⁺ signal of 31 670 was observed, which is a 97 u or 0.3% difference to the calculated mass. While, in case of the modified sample this mass difference diminished to 10 u or 0.03% (31713 u found, 31723 u calculated). These observations are verified by a significant increase in mass resolution of the (M+H)+signal of the two signal strands (n/Δm=67 as opposed to 18 for the unmodified sample with Δm=full width at half maximum, fwhm). Because of the low mass difference between the two single strands (8 u) their individual signals were not resolved.

[0431] With the results of the 99 base pair DNA fragments the effects of increased mass resolution for 7-deazapurine containing DNA becomes even more evident. The two single strands in the unmodified sample were not resolved even though the mass difference between the two strands of the amplified product was very high with 526 u due to unequal distribution of purines and pyrimidines (FIG. 41a). In contrast to this, the modified DNA showed distinct peaks for the two single strands (FIG. 41b) which demonstrates the superiority of this approach for the determination of molecular weights to gel electrophoretic methods even more profound. Although base line resolution was not obtained the individual masses were able to be assigned with an accuracy of 0.1%: Δm=27 u for the lighter (calc. mass=30224 u) and Δm=14 u for the heavier strand (calc. mass=30750 u). Again, it was found that the full width at half maximum was substantially decreased for the 7-deazapurine containing sample.

[0432] In case the 99-mer and 103-mer, the 7-deazapurine containing nucleic acids seem to give higher sensitivity despite the fact that they still contain about 20% unmodified purine nucleotides. To get comparable signal-to-noise ratio at similar intensities for the (M+H)⁺ signals, the unmodified 99-mer required 20 laser shots in contrast to 12 for the modified one and the 103-mer required 12 shots for the unmodified sample as opposed to three for the 7-deazapurine nucleoside-containing amplified product.

[0433] Comparing the spectra of the modified and unmodified 200-mer amplicons, improved mass resolution was again found for the 7-deazapurine containing sample as well as increased signal intensities (FIGS. 42A and 42B). While the signal of the single strands predominates in the spectrum of the modified sample the DNA-duplex and dimers of the single strands gave the strongest signal for the unmodified sample.

[0434] A complete 7-deaza purine modification of nucleic acids may be achieved either using modified primers in PCR or cleaving the unmodified primers from the partially modified amplified product. Since disadvantages are associated with modified primers, as described above, a 100-mer was synthesized using primers with a ribo-modification. The primers were cleaved hydrolytically with NaOH according to a method developed earlier in our laboratory (Koester, H. et al., Z Physiol. Chem., 359, 1570-1589). FIGS. 43A and 43B display the spectra of the amplified product before and after primer cleavage. FIG. 43b shows that the hydrolysis was successful: The hydrolyzed amplified product as well as the two released primers could be detected together with a small signal from residual uncleaved 100-mer. This procedure is especially useful for the MALDI-TOF analysis of very short PCR-products since the share of unmodified purines originating from the primer increases with decreasing length of the amplified sequence.

[0435] The remarkable properties of 7-deazapurine modified nucleic acids can be explained by either more effective desorption and/or ionization, increased ion stability and/or a lower denaturation energy of the double stranded purine modified nucleic acid. The exchange of the N-7 for a methyl group results in the loss of one acceptor for a hydrogen bond which influences the ability of the nucleic acid to form secondary structures due to non-Watson-Crick base pairing (Seela, F. and A. Kehne (1987) Biochemistry, 26, 2232-2238.). In addition to this the aromatic system of 7-deazapurine has a lower electron density that weakens Watson-Crick base pairing resulting in a decreased melting point (Mizusawa, S. et al., (1986) Nucleic Acids Res., 14, 1319-1324) of the double-strand. This effect may decrease the energy needed for denaturation of the duplex in the MALDI process. These aspects as well as the loss of a site which probably will carry a positive charge on the N-7 nitrogen renders the 7-deazapurine modified nucleic acid less polar and may promote the effectiveness of desorption.

[0436] Because of the absence of N-7 as proton acceptor and the decreased polarization of the C-N bond in 7-deazapurine nucleosides depurination following the mechanisms established for hydrolysis in solution is prevented. Although a direct correlation of reactions in solution and in the gas phase is problematic, less fragmentation due to depurination of the modified nucleic acids can be expected in the MALDI process. Depurination may either be accompanied by loss of charge which decreases the total yield of charged species or it may produce charged fragmentation products which decreases the intensity of the non fragmented molecular ion signal.

[0437] The observation of increased sensitivity and decreased peak tailing of the (M+H)⁺ signals on the lower mass side due to decreased fragmentation of the 7-deazapurine containing samples indicate that the N-7 atom indeed is essential for the mechanism of depurination in the MALDI-TOF process. In conclusion, 7-deazapurine containing nucleic acids show distinctly increased ion-stability and sensitivity under MALDI-TOF conditions and therefore provide for higher mass accuracy and mass resolution.

EXAMPLE 9

[0438] Solid Phase Sequencing and Mass Spectrometer Detection

[0439] MATERIALS AND METHODS

[0440] Oligonucleotides were purchased from Operon Technologies (Alameda, Calif.) in an unpurified form. Sequencing reactions were performed on a solid surface using reagents from the sequencing kit for Sequenase Version 2.0 (Amersham, Arlington Heights, Ill.).

[0441] Sequencing a 39-mer target

[0442] Sequencing complex: SEQ ID SEQUENCE NO. 5′-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-(A^(b))_(a)-3′ 23 5′-TGTACGTCACAACT-3′ (PNA 16/DNA) 24

[0443] In order to perform solid-phase DNA sequencing, template strand DNA 1 1 683 was 3′-biotinylated by terminal deoxynucleotidyl transferase. A 30 μl reaction, containing 60 pmol of DNA11683, 1.3 nmol of biotin 14-dATP (GIBCO BRL, Grand Island, N.Y.), 30 units of terminal transferase (Amersham, Arlington Heights, Ill.), and 1×reaction buffer (supplied with enzyme), was incubated at 37° C. for 1 hour. The reaction was stopped by heat inactivation of the terminal transferase at 70° C. for 10 min. The resulting product was desalted by passing through a TE-10 spin column (Clontech). More than one molecules of biotin-14-dATP could be added to the 3′-end of DNA11683. The biotinylated DNA11683 was incubated with 0.3 mg of Dynal streptavidin beads in 30 μl 1×binding and washing buffer at ambient temperature for 30 min. The beads were washed twice with TE and redissolved in 30 μl TE, 10 μl aliquot (containing 0.1 mg of beads) was used for sequencing reactions.

[0444] The 0.1 mg beads from previous step were resuspended in a 10 μl volume containing 2 μl of 5×Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCI₂, and 250 mM NaCI) from the Sequenase kit and 5 pmol of corresponding primer PNA 16/DNA. The annealing mixture was heated to 70° C. and allowed to cool slowly to room temperature over a 20-30 min time period. Then 1 μl 0.1 M dithiothreitol solution, 1 μl Mn buffer (0.15 M sodium isocitrate and 0.1 M MgCl₂), and 2μl of diluted Sequenase (3.25 units) were added. The reaction mixture was divided into four aliquots of 3 μl each and mixed with termination mixes (each contains of 3 μl of the appropriate termination mix: 32 μM c7dATP, 32 μM dCTP, 32 μM c7dGTP, 32 μM dTTP and 3.2 μM of one of the four ddTNPs, in 50 mM NaCl). The reaction mixtures were incubated at 37° C. for 2 min. After the completion of extension, the beads were precipitated and the supernatant was removed. The beads were washed twice and resuspended in TE and kept at 4° C.

[0445] Sequencing a 78-mer target 5′-AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGCTGCTGCT (SEQ ID NO. 25) GCTGCTGGATGATCCGACGCATCAGATCTGG-(A^(b))_(n)-3′ (TNR.PLASM2) 5′-CTGATGCGTCGGATCATC-3′(CM1) (SEQ ID NO. 26)

[0446] The target TNR.PLASM2 was biotinylated and sequenced using procedures similar to those described in previous section (sequencing a 39-mer target).

[0447] Sequencing a 15-mer target with partially duplex probe ^(5′)-F-GATGATCCGACGCATCACAGCTC^(3′) (SEQ ID No. 27) ^(5′)-TCGGTTCCAAGAGCTGTGATGCGTCGGATCATC-b-^(3′) (SEQ ID No. 28)

[0448] CM1 B3B was immobilized on Dynabeads M280 with streptavidin (Dynal, Norway) by incubating 60 pmol of CM1B3B with 0.3 magnetic beads in 30 μl 1 M NaCl and TE (1×binding and washing buffer) at room temperature for 30 min. The beads were washed twice with TE and redissolved in 30 μl TE, 10 or 20 p1 aliquot (containing 0.1 or 0.2 mg of beads respectively) was used for sequencing reactions.

[0449] The duplex was formed by annealing corresponding aliquot of beads from previous step with 10 pmol of DF11 a5F (or 20 pmol of DF11 a5F for 0.2 mg of beads) in a 9 μl volume containing 2 μl of 5×Sequenase buffer (200 mM Tris-HCI, pH 7.5, 100 mM MgCl₂, and 250 mM NaCl) from the Sequenase kit. The annealing mixture was heated to 65° C. and allowed to cool slowly to 37° C. over a 20-30 min time period. The duplex primer was then mixed with 10 pmol of TS10 (20 pmol of TS10 for 0.2 mg of beads) in 1 μl volume, and the resulting mixture was further incubated at 37° C. for 5 min, room temperature for 5-10 min. Then 1 μl 0.1 M dithiothreitol solution, 1 ul Mn buffer (0.15 M sodium isocitrate and 0.1 M MnCl₂), and 2 μl of diluted Sequenase (3.25 units) were added. The reaction mixture was divided into four aliquots of 3 μl each and mixed with termination mixes (each contains of 4 μl of the appropriate termination mix: 16 μM dATP, 16 μM dCTP, 16 μM dGTP, 16 μM dTTP and 1.6 μM of one of the four ddNTPs, in 50 mM NaCl). The reaction mixtures were incubated at room temperature for 5 min, and 37° C. for 5 min. After the completion of extension, the beads were precipitated and the supernatant was removed. The beads were resuspended in 20 μl TE and kept at 4° C. An aliquot of 2 μl (out of 20 μl) from each tube was taken and mixed with 8 1 μl of formamide, the resulting samples were denatured at 90-95° C. for 5 min and 2 μl (out of 10 μl total) was applied to an ALF DNA sequencer (Pharmacia, Piscataway, N.J.) using a 10% polyacrylamide gel containing 7 M urea and 0.6×TBE. The remaining aliquot was used for MALDI-TOF MS analysis.

[0450] MALDI sample preparation and instrumentation

[0451] Before MALDI analysis, the sequencing ladder loaded magnetic beads were washed twice using 50 mM ammonium citrate and resuspended in 0.5 μl pure water. The suspension was then loaded onto the sample target of the mass spectrometer and 0.5 μl of saturated matrix solution (3-hydroxypicolinic acid (HPA): ammonium citrate=10:1 mole ratio in 50% acetonitrile) was added. The mixture was allowed to dry prior to mass spectrometer analysis.

[0452] The reflectron TOFMS mass spectrometer (Vision 2000, Finnigan MAT, Bremen, Germany) was used for analysis. 5 kV was applied in the ion source and 20 kV was applied for postacceleration. All spectra were taken in the positive ion mode and a nitrogen laser was used. Normally, each spectrum was averaged for more than 1 00 shots and a standard 25-point smoothing was applied.

[0453] RESULTS AND DISCUSSION

[0454] Conventional solid-phase sequencing

[0455] In conventional sequencing methods, a primer is directly annealed to the template and then extended and terminated in a Sanger dideoxy sequencing. Normally, a biotinylated primer is used and the sequencing ladders are captured by streptavidin-coated magnetic beads. After washing, the products are eluted from the beads using EDTA and formamide. Previous findings indicated that only the annealed strand of a duplex is desorbed and the immobilized strand remains on the beads. Therefore, it is advantageous to immobilize the template and anneal the primer. After the sequencing reaction and washing, the beads with the immobilized template and annealed sequencing ladder can be loaded directly onto the mass spectrometer target and mix with matrix. In MALDI, only the annealed sequencing ladder will be desorbed and ionized, and the immobilized template will remain on the target.

[0456] A 39-mer template (SEQ ID No. 23) was first biotinylated at the 3′-end by adding biotin-14-dATP with terminal transferase. More than one biotin-14-dATP molecule could be added by the enzyme. Since the template was immobilized and remained on the beads during MALDI, the number of biotin-14-dATP would not affect the mass spectra. A 14-mer primer (SEQ ID No. 24) was used for the solid-state sequencing to generate DNA fragments 3-27 below (SEQ ID Nos. 142-166). MALDI-TOF mass spectra of the four sequencing ladders are shown in FIG. 44 and the expected theoretical values are shown in Table III. TABLE IIi 1 5′-TCTGGCCTGGTGCAGGGCCTATTGTAGTTGTGACGTACA-(A^(B))_(n)-3′ 2                             3′-TCAACACTGCATGT-5- 3                            3′-ATCAACACTGCATGT-5′ 4                           3′-CATCAACACTGCATGT-5′ 5                          3′-ACATCAACACTGCATGT-5′ 6                         3′-AACATCAACACTGCATGT-5′ 7                        3′-TAACATCAACACTGCATGT-5′ 8                       3′-ATAACATCAACACTGCATGT-5′ 9                      3′-GATAACATCAACACTGCATGT-5′ 10                     3′-GGATAACATCAACACTGCATGT-5′ 11                    3′-CGGATAACATCAACACTGCATGT-5′ 12                   3′-CCGGATAACATCAACACTGCATGT-5′ 13                  3′-CCCGGATAACATCAACACTGCATGT-5′ 14                 3′-TCCCGGATAACATCAACACTGCATGT-5′ 15                3′-GTCCCGGATAACATCAACACTGCATGT-5′ 16               3′-CGTCCCGGATAACATCAACACTGCATGT-5′ 17              3′-ACGTCCCGGATAACATCAACACTGCATGT-5′ 18             3′-CACGTCCCGGATAACATCAACACTGCATGT-5′ 19            3′-CCACGTCCCGGATAACATCAACACTGCATGT-5′ 20           3′-ACCACGTCCCGGATAACATCAACACTGCATGT-5′ 21          3′-GACCACGTCCCGGATAACATCAACACTGCATGT-5′ 22         3′-GGACCACGTCCCGGATAACATCAACACTGCATGT-5′ 23        3′-CGGACCACGTCCCGGATAACATCAACACTGCATGT-5′ 24       3′-CCGGACCACGTCCCGGATAACATCAACACTGCATGT-5′ 25      3′-ACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5′ 26     3′-GACCGGACCACGTCCGGGATAACATCAACAGTGCATGT-5′ 27 3′-AGACCGGACCACGTCCCGGATAACATCAACACTGCATGT-5′ A-reaction C-reaction G-reaction T-reaction 1. 2. 4223.8 4223.8 4223.8 4223.8 3. 4521.1 4. 4809.2 5. 5133.4 6. 5434.6 7. 5737.8 8. 6051.1 9. 6379.2 10. 6704.4 11. 6995.6 12. 7284.8 13. 7574.0 14. 7878.2 15. 8207.4 16. 8495.6 17.8808.8 18. 9097.0 19. 9386.2 20.9699.4 21. 10027.6 22. 10355.8 23. 10644.0 24. 10933.2 25. 11246.4 26. 11574.6 27. 11886.8

[0457] The sequencing reaction produced a relatively homogenous ladder, and the full-length sequence was determined easily. One peak around 5150 appeared in all reactions are not identified. A possible explanation is that a small portion of the template formed some kind of secondary structure, such as a loop, which hindered sequenase extension. Mis-incorporation is of minor importance, since the intensity of these peaks were much lower than that of the sequencing ladders. Although 7-deaza purines were used in the sequencing reaction, which could stabilize the N-glycosidic bond and prevent depurination, minor base losses were still observed since the primer was not substituted by 7-deazapurines. The full length ladder, with a ddA at the 3′ end, appeared in the A reaction with an apparent mass of 11899.8. A more intense peak of 12333 appeared in all four reactions and is likely due to an addition of an extra nucleotide by the Sequenase enzyme.

[0458] The same technique could be used to sequence longer DNA fragments. A 78-mer template containing a CTG repeat (SEQ ID No. 25) was 3′-biotinylated by adding biotin-14-dATP with terminal transferase. An 18-mer primer (SEQ ID No. 26) was annealed right outside the CTG repeat so that the repeat could be sequenced immediately after primer extension. The four reactions were washed and analyzed by MALDI-TOFMS as usual. An example of the G-reaction is shown in FIG. 45 (SEQ ID Nos. 167-220) and the expected sequencing ladder is shown in Table IV with theoretical mass values for each ladder component. All sequencing peaks were well resolved except the last component (theoretical value 20577.4) was indistinguishable from the background. Two neighboring sequencing peaks (a 62-mer and a 63-mer) were also separated indicating that such sequencing analysis could be applicable to longer templates. Again, an addition of an extra nucleotide by the Sequenase enzyme was observed in this spectrum. This addition is not template specific and appeared in all four reactions which makes it easy to be identified. Compared to the primer peak, the sequencing peaks were at much lower intensity in the long template case. TABLE Iv AAGATCTGACCAGGGATTCGGTTAGCGTGACTGCTGCTGGTGCTGCTGGATGATCCGACGCATCAGATCTGG-(A^(B))_(n)-3′ 1                                                       3′-GTACTAGGCTGCGTAGTG-5′ 2                                                      3′-CCTACTAGGCTGCGTAGTC-5′ 3                                                     3′-ACCTACTAGGCTGCGTAGTC-5′ 4                                                    3′-GACCTACTAGGCTGCGTAGTC-5′ 5                                                   3′-CGAGCTACTAGGGTGCGTAGTC-5′ 6                                                  3′-AGGACCTACTAGGCTGCGTAGTC-5′ 7                                                 3′-GACGACCTACTAGGCTGCGTAGTC-5′ 8                                                3′-CGACGACCTAGTAGGCTGCGTAGTC-5′ 9                                               3′-ACGACGACCTACTAGGCTGGGTAGTG-5′ 10                                              3′-GACGACGAGCTACTAGGCTGCGTAGTC-5′ 11                                             3′-CGACGACGACCTACTAGGCTGCGTAGTC-5′ 12                                            3′-ACGACGACGACCTACTAGGCTGCGTAGTC-5′ 13                                           3′-GACGACGACGACCTACTAGGCTGCGTAGTC-5′ 14                                          3′-CGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 15                                         3′-ACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 16                                        3′-GACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 17                                       3′-CGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 18                                      3′-AGGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 19                                     3′-GACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 20                                    3′-GGAGGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 21                                   3′-ACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 22                                  3′-GACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 23                                 3′-CGACGACGACGACGACGAGGACCTACTAGGCTGCGTAGTC-5′ 24                                3′-ACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 25                               3′-GACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 26                              3′-TGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 27                             3′-CTGACGAGGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 28                            3′-ACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 29                           3′-GACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 30                          3′-GCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 31                         3′-CGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 32                        3′-TCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 33                       3′-ATCGCACTGAGGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 34                      3′-AATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 35                     3′-CAATCGGACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 36                    3′-CCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 37                   3′-GCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 38                  3′-AGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 39                 3′-AAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 40                3′-TAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 41               3′-CTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 42              3′-CCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 43             3′-CCCTAAGCCAATGGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 44            3′-TCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 45           3′-GTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 46          3′-GGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 47         3′-TGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 48        3′-CTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 49       3′-ACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 50      3′-GACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTG-5′ 51     3′-AGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ 52    3′-TAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGGGTAGTC-5′ 53   3′-CTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGAGGACGACCTACTAGGCTGCGTAGTC-5′ 54  3′-TCTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTG-5′ 55 3′-TTCTAGACTGGTCCCTAAGCCAATCGCACTGACGACGACGACGACGACGACGACCTACTAGGCTGCGTAGTC-5′ ddATP ddCTP ddGTP ddTTP 1. 5491.6 5491.6 5491.6 5491.6 2. 5764.8 3. 6078.0 4. 6407.2 5. 6696.4 6. 7009.6 7. 7338.8 8. 7628.0 9. 7941.2 10. 8270.4 11. 8559.6 12. 8872.8 13. 9202.0 14. 9491.2 15. 9804.4 16. 10133.6 17. 10422.88 18. 10736.0 19. 11065.2 20. 11354.4 21. 11667.6 22. 11996.8 23. 12286.0 24. 12599.2 25. 12928.4 26. 13232.6 27. 13521.8 28. 13835.0 29. 14124.2 30. 14453.4 31. 14742.6 32. 15046.8 33. 15360.0 34. 15673.2 35. 15962.4 36. 16251.6 37. 16580.8 38. 16894.0 39. 17207.2 40. 17511.4 41. 17800.6 42. 18189.8 43. 18379.0 44. 18683.2 45. 19012.4 46. 19341.6 47. 19645.8 48. 19935.0 49. 20248.2 50. 20577.4 51. 20890.6 52. 21194.4 53. 21484.0 54. 21788.2 55. 22092.4

[0459] Sequencing using duplex DNA probes for capturing and priming

[0460] Duplex DNA probes with single-stranded overhang have been demonstrated to be able to capture specific DNA templates and also serve as primers for solid-state sequencing. The scheme is shown in FIG. 46. Stacking interactions between a duplex probe and a single-stranded template allow only a 5-base overhang to be sufficient for capturing. Based on this format, a 5′ fluorescent-labeled 23-mer (5′-GAT GAT CCG ACG CAT CAC AGC TC-3′) (SEQ ID No. 29) was annealed to a 3′-biotinylated 18-mer (5′-GTG ATG CGT CGG ATC ATC-3′) (SEQ ID No. 30), leaving a 5-base overhang. A 15-mer template (5′-TCG GTT CCA AGA GCT-3′) (SEQ ID No. 31) was captured by the duplex and sequencing reactions were performed by extension of the 5-base overhang. MALDI-TOF mass spectra of the reactions are shown in FIG. 47A-D. All sequencing peaks were resolved although at relatively low intensities. The last peak in each reaction is due to unspecific addition of one nucleotide to the full length extension product by the Sequenase enzyme. For comparison, the same products were run on a conventional DNA sequencer and a stacking fluorogram of the results is shown in FIG. 48. As can be seen from the Figure, the mass spectra had the same pattern as the fluorogram with sequencing peaks at much lower intensity compared to the 23-mer primer.

EXAMPLE 10

[0461] Thermo Sequenase Cycle Sequencing

[0462] MATERIALS AND METHODS

[0463] PCR amplification. Human leukocytic genomic DNA was used for PCR amplification. PCR primers to amplify a 209 bp fragment of the β-globin gene were the β2 forward primer (5′-CAT TTG CTT CTG ACA CAA CTG-3′ SEQ ID NO. 32) and the β11 reverse primer (5′-CTT CTC TGT CTC CAC ATG C-3′ SEQ ID NO. 33). Taq polymerase and 10×buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 μl including 8 pmol of each primer with approximately 200 ng of genomic DNA used as template and a final dNTP concentration of 200 μM. PCR conditions were: 5 min at 94° C., followed by 40 cycles of 30 sec at 94° C., 45 sec at 53° C., 30 sec at 72° C., and a final extension time of 2 min at 72° C. The generated amplified product was purified and concentrated (2×) with the Qiagen ‘Qiaquick’ PCR purification kit (#28106) and stored in H₂O.

[0464] Cycle Sequencing. Sequencing ladders were generated by primer extension with Thermo Sequenase™-DNA Polymerase (Amersham LIFE Science, #E79000Y) under the following conditions: 7 pmol of HPLC purified primer (Cod5 12mer: 5′-TGC ACC TGA CTC-3′ SEQ ID No. 34) were added to 6 μl purified and concentrated amplified product (i.e. 12 μl of the original amplified product), 2.5 units Thermo Sequenase and 2.5 ml Thermo Sequenase reaction buffer in a total volume of 25 μl. The final nucleotide concentrations were 30 μM of the appropriate ddNTP (ddATP, ddCTP, ddGTP or ddTTP; Pharmacia Biotech, #27-2045-01) and 210 μM of each dNTP (7-deaza-dATP, DCTP, 7-deaza-GTP, dTTP; Pharmacia Biotech).

[0465] Cycling conditions were: denaturation for 4 min at 94° C., followed by 35 cycles of 30 sec at 94° C, 30 sec at 38° C., 30 sec at 55° C., and a final extension of 2 min at 72° C.

[0466] Sample preparation and analysis by MALDI-TOF MS. After completion of the cycling program, the reaction volume was increased to 50 μl by addition of 25 μl H₂O. Desalting was achieved by shaking 30 μl of ammonium saturated DOWEX (Fluka #44485) cation exchange beads with 50 μl of the analyte for 2 min at room temperature. The Dowex beads, purchased in the protonated form, were pre-treated with 2M NH₄OH to convert them to the ammonium form, then washed with H₂O until the supernatant was neutral, and finally put in 10 mM ammonium citrate for usage. After the cation exchange, DNA was purified and concentrated by ethanol precipitation by adding 5 μl 3 M ammonium acetate (pH 6.5), 0.5 μl glycogen (10 mg/ml, Sigma), and 110 μl absolute ethanol to the analyte and incubated at room temperature for 1 hour. After 1 2 min centrifugation at 20,000×g the pellet was washed in 70% ethanol and resuspended in 1 μl 18 Mohm/cm H₂O water.

[0467] For MALDI-TOF MS analysis 0.35 μl of resuspended DNA was mixed with 0.35-1.3 μl matrix solution (0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H₂O:CH₃CN) on a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. External calibration generated from eight peaks (3000-18000 Da) was used for all spectra.

[0468] RESULTS

[0469]FIG. 49 shows a MALDI-TOF mass spectrum of the sequencing ladder generated from a biological amplified product as template and a 12mer (5′-TGC ACC TGA CTC-3′(SEQ ID NO.34)) sequencing primer. The peaks resulting from depurinations and peaks which are not related to the sequence are marked by an asterisk. MALDI-TOF MS measurements were taken on a reflectron TOF MS. A.) Sequencing ladder stopped with ddATP; B.) Sequencing ladder stopped with ddCTP; C.) Sequencing ladder stopped with ddGTP; D.) Sequencing ladder stopped with ddTTP.

[0470]FIG. 50 shows a schematic representation of the sequencing ladder generated in FIG. 49 with the corresponding calculated molecular masses up to 40 bases after the primer (SEQ ID Nos 221-260). For the calculation the following masses were used: 3581.4 Da for the primer, 312.2 Da for 7-deaza-dATP, 304.2 Da for dTTP, 289.2 Da for dCTP and 328.2 Da for 7-deaza-dGTP.

[0471]FIG. 51 shows the sequence of the amplified 209 bp amplified product within the β-globin gene (SEQ ID No. 261), which was used as a template for sequencing. The sequences of the appropriate PCR primer and the location of the 1 2mer sequencing primer is also shown. This sequence represents a homozygote mutant at the position 4 after the primer. In a wildtype sequence this T would be replaced by an A.

EXAMPLE 11

[0472] Microsatellite Analysis Using Primer Oligo Base Extension (PROBE) and MALDI-TOF Mass Spectrometry

[0473] SUMMARY

[0474] The method uses a single detection primer followed by an oligonucleotide extension step to give products differing in length by a number of bases specific for the number of repeat units or for second site mutations within the repeated region, which can be easily resolved by MALDI-TOF mass spectrometry. The method is demonstrated using as a model system the AluVpA polymorphism in intron 5 of the interferon-a receptor gene located on human chromosome 21, and the poly T tract of the splice acceptor site of intron 8 from the CFTR gene located on human chromosome 7.

[0475] MATERIALS AND METHODS

[0476] Genomic DNA was obtained from 18 unrelated individuals and one family including of a mother, father, and three children. The repeated region was evaluated conventionally by denaturing gel electrophoresis and results obtained were confirmed by standard Sanger sequencing.

[0477] The primers for PCR amplification (8 pmol each) were IFNAR-IVS5-5′: (5′-TGC TTA CTT AAC CCA GTG TG-3′SEQ ID. NO.35) and IFNAR-IVS5-3′.2: (5′-CAC ACT ATG TAA TAC TAT GC-3′ SEQ ID. NO.36) for a part of the intron 5 of the interferon-α receptor gene, and CFEx9-F:(5′-GAA AAT ATC TGA CAA ACT CAT C-3′ SEQ ID. NO.37) (5′-biotinylated) and CFEx9-R:(5′-CAT GGA CAC CAA ATT AAG TTC-3′SEQ ID. NO.38) for CFTR exon 9 with flanking intron sequences of the CFTR gene. Taq-polymerase including lOx buffer were purchased from Boehringer-Mannheim and dNTPs were obtained from Pharmacia. The total reaction volume was 50 μl. PCR conditions were 5 min at 94° C. followed by 40 cycles of: 1 min at 94° C., 45 sec at 53° C., and 30 sec at 72° C., and a final extension time of 5 min at 72° C.

[0478] Amplification products were purified using Qiagen's PCR purification kit (No.28106) according to manufacturer's instructions. Purified products were eluted from the column in 50 μl TE-buffer (10 mM Tris-HCl, 1 mM EDTA, pH 7,5).

[0479] A) Primer oligo base extension reaction (thermo cycling method

[0480] CyclePROBE was performed with 5 pmol appropriate detection primer (IFN:5′-TGA GAC TCT GTC TC-3′SEQ ID. NO.39) in a total volume of 25 μl including I pmol purified template, 2 units Thermosequenase (Amersham Life Science, Cat. #E79000Y) 2.5 μl Thermosequenase buffer, 25 μmol of each deoxynucleotide (7-deaza-dATP, dTTP, and in some experiments extra dCTP) and 100 μmol of dideoxyguanine and in some experiments additional ddCTP. Cycling conditions: initial denaturation 94° C. for 5 min followed by 30 cycles with 44° C. annealing temperature for 30 sec and 55° C. extension temperature for 1 min.

[0481] Primer oligo base extension reaction (isothermal method)

[0482] 10 μl aliquots of the purified double-stranded amplified product (˜3 pmol) were transferred to a streptavidin-coated microliter plate well (˜16 pmol capacity per 50 μl volume; No. 1645684 Boehringer-Mannheim), followed by addition of 10 μl incubation buffer (80 mM sodium phosphate, 400 mM NaCl, 0.4% Tween 20, pH 7.5) and 30 μl water. After incubation for 1 hour at room temperature, the wells were washed three times with 200 μl washing buffer A (40 mM Tris, 1 mM EDTA, 50 mM NaCl, 0.1% Tween 20, pH 8.8) and incubated with 100 μl of 50 mM NaOH for 3 min to denature the double-stranded DNA. Finally, the wells were washed three times with 200 μl 70 mM ammonium citrate solution.

[0483] The annealing of 100 pmol detection primer (CFPT: 5′-TTC CCC AAA TCC CTG-3′ SEQ ID NO. 40) was performed in 50 μl annealing buffer (50 mM ammonium phosphate buffer, pH 7.0 and 100 mM ammonium chloride) at 65° C. for 2 min, at 37° C. for 10 min, and at room temperature for 10 min. The wells were washed three times with 200 μl washing buffer B (40 mM Tris, 1 mM EDTA, 50 mM NH₄Cl, 0.1% Tween 20, pH 8.8) and once in 200 μl TE buffer. The extension reaction was performed using some components of the DNA sequencing kit from USB (No. 70770) and dNTPs or ddNTPs from Pharmacia. Total reaction volume was 45 μl, containing of 21 μl water, 6 μl Sequenase-buffer, 3 μl 100 mM DTT solution, 50 pmol of 7-deaza-dATP, 20 μmol ddCTP, 5.5 μl, glycerol enzyme dilution buffer, 0.25 μl Sequenase 2.0, and 0.25 μl pyrophosphatase. The reaction was pipetted on ice and incubated for 15 min at room temperature and for 5 min at 37° C. Finally,the wells were washed three times with 200 μl washing buffer B.

[0484] The extended primer was denatured from the template strand by heating at 80° C. for 10 min in 50 μl of a 50 mM ammonium hydroxide solution.

[0485] For precipitation, 10 μl 3 M NH4-acetate (pH 6.5), 0.5 ul glycogen (10 mg/ml water, Sigma, Cat.#G1765), and 110 μl absolute ethanol were added to the supernatant and incubated for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min the pellet was washed in 70% ethanol and resuspended in 1 μl 18 Mohm/cm H₂O water.

[0486] Sample preparation was performed by mixing 0.6 μl of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium citrate in 1:1 H₂O:CH₃CN) with 0.3 μl, of resuspended DNA/glycogen pellet on a sample target and allowed to air dry. Up to 20 samples were spotted on a probe target disk for introduction into the source region of a Thermo Bioanalysis (formerly Finnigan) Visions 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M_(r)(calc)) were calculated from atomic compositions; reported experimental M_(r) (M_(r)(exp)) values are those of the singly-protonated form, determined using external calibration.

[0487] RESULTS

[0488] The aim of the experiments was to develop a fast and reliable method for the exact determination of the number of repeat units in microsatellites or the length of a mononucleotide stretch including the potential to detect second site mutations within the polymorphic region. Therefore, a special kind of DNA sequencing (primer oligo base extension, PROBE) was combined with the evaluation of the resulting products by matrix-assisted laser desorption ionization (MALDI) mass spectrometry (MS). The time-of-flight (TOF) reflectron arrangement was chosen-as a possible mass measurement system. As an initial feasibility study, an examination was performed first on an AluVpA repeat polymorphism located in intron 5 of the human interferon-α receptor gene (cyclePROBE reaction) and second on the poly T tract located in intron 8 of the human CFTR gene (isothermal PROBE reaction).

[0489] A schematic presentation of the cyclePROBE experiment for the AluVpA repeat polymorphism is given in FIG. 52. The extension of the antisense strand (SEQ ID No. 262) was performed with the sense strand serving as the template. The detection primer is underlined. In a family study co-dominant segregation of the various alleles could be demonstrated by the electrophoretic procedure as well as by the cyclePROBE method followed by mass spec analysis (FIG. 53). Those alleles of the mother and child 2, for which direct electrophoresis of the amplified product indicated one of the two copies to have 13 repeat units,were measured using cyclePROBE to have instead only 11 units using ddG as terminator. The replacement of ddG by ddC resulted in a further unexpected short allele with a molecular mass of approximately 11650 in the DNA of the mother and child 2 (FIG. 54). Sequence analysis verified this presence of two second site mutations in the allele with 13 repeat units. The first is a C to T transition in the third repeat unit and the second mutation is a T to G transversion in the ninth repeat unit. Examination of 28 unrelated individuals shows that the 13 unit allele is spliced into a normal allele and a truncated allele using cyclePROBE. Statistical evaluation shows that the polymorphism is in Hardy-Weinberg equilibrium for both methods, however, using cyclePROBE as detection method the polymorphism information content is increased to 0.734.

[0490] PROBE was also used as an isothermic method for the detection of the three common alleles at the intron 8 splice acceptor site of the CFTR gene (SEQ ID No. 263). FIG. 55 shows a schematic presentation of the expected diagnostic products (SEQ ID Nos. 264-266) with the theoretical mass values. The reaction was also performed in the antisense direction.

[0491]FIG. 56 demonstrates that all three common alleles (T5, T7, and T9, respectively) at this locus could be reliably disclosed by this method. Reference to FIG. 56 indicates that mass accuracy and precision with the reflectron time of flight used in this study ranged from 0-0.4%, with a relative standard deviation of 0.13%. This corresponds to far better than single base accuracy for the up to <90-mer diagnostic products generated in the IFNAR system. Such high analytical sensitivity is sufficient to detect single or multiple insertion/deletion mutations within the repeat unit or its flanking regions, which would induce >1% mass shifts in a 90-mer. This is analogous to the FIG. 56 polyT tract analysis. Other mutations (i.e. an A to T or a T to A mutation within the IFNAR gene A3T repeat) which do not cause premature product termination are not detectable using any dNTP/ddNTP combination with PROBE and low performance MS instrumentation; a 9 Da shift in a 90-mer corresponds to a 0.03% mass shift. Achieving the accuracy and precision required to detect such minor mass shifts has been demonstrated with higher performance instrumentation such as Fourier transform (FT)MS, for which single Da accuracy is obtained up to 100-mers. Further, tandem FTMS, in which a mass shifted fragment can be isolated within the instrument and dissociated to generate sequence specific fragments, has been demonstrated to locate point mutations to the base in comparably sized products. Thus the combination of PROBE with higher performance instrumentation will have an analytical sensitivity which can be matched only by cumbersome full sequencing of the repeat region.

EXAMPLE 12

[0492] Improved Apolipoprotein E Genotyping Using Primer Oligo Base Extension (PROBE) and MALDI-TOF Mass Spectrometry

[0493] MATERIALS AND METHODS

[0494] PCR amplification.

[0495] Human leukocytic genomic DNA from 100 anonymous individuals from a previously published study (Braun, A etal., (1992) Human Genet. 89:401-406) were screened for apolipoprotein E genotypes using conventional methods. PCR primers to amplify a portion of exon 4 of the apo E gene were delineated according to the published sequence (Das, HK et al., (1985) J. Biol. Chem. 260:6240-6247) (forward primer, apoE-F: 5′-GGC ACG GCT GTC CAA GGA G-3′SEQ ID. NO.41; reverse, apoE-R: 5′-AGG CCG CGC TCG GCG CCC TC-3′SEQ ID. NO.42). Taq polymerase and 10×buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 μL including 8 pmol of each primer and 10% DMSO (dimethylsulfoxide, Sigma) with approximately 200 ng of genomic DNA used as template. Solutions were heated to 80° C. before the addition of 1U polymerase; PCR conditions were: 2 min at 94° C., followed by 40 cycles of 30 sec at 94° C., 45 sec at 63° C., 30 sec at 72° C., and a final extension time of 2 min at 72° C.

[0496] Restriction enzyme digestion and polyacrylamide electrophoresis.

[0497] Cfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim, and Hhal from Pharmacia (Freiburg, Germany). For Cfol alone and simultaneous Cfol/Rsal digestion, 20 pL of amplified products were diluted with 15 μl water and 4 pL Boehringer-Mannheim buffer L; after addition of 10 units of appropriate restriction enzyme(s) the samples were incubated for 60 min at 37° C. The procedure for simultaneous Hhal/Rsal digestion required first digestion by Rsal in buffer L for one hour followed by addition of NaCl (50 mM end concentration) and Hhal, and additional incubation for one hour. 20 pL of the restriction digest were analyzed on a 12% polyacrylamide gel as described elsewhere (Hixson (1990) J. Lipid Res. 31:545-548). Recognition sequences of Rsal and Cfol (Hhal) are GT/AC and GCG/C, respectively; masses of expected digestion fragments from the 252-mer amplified product with Cfol alone and the simultaneous double digest with Cfol (or Hhal) and Rsal are given in Table V.

[0498] Thermo-PROBE.

[0499] PCR amplification was performed as described above, but with products purified with the Qiagen' Qiaquick' kit to remove unincorporated primers. Multiplex Thermo-PROBE was performed with 35 μl amplified product and 8 pmol each of the codon 112 (5′-GCG GAC ATG GAG GAC GTG-3′ SEQ ID. NO.43) and 158 (5′-GAT GCC GAT GAC CTG CAG AAG-3′ SEQ ID. NO.44) detection primers in 20 μl including ˜1 pmol purified biotinylated antisense template immobilized on streptavidin coated magnetic beads, 2.5 units Thermosequenase, 2 μl Thermosequenase buffer, 50 μM of each dNTP and 200 μM of ddXTP, with the base identity of N and X as described in the text. Cycling conditions were: denaturation (94° C., 30 sec) followed by 30 cycles at 94° C. (10 min) and 60° C. (45 sec).

[0500] Sample preparation and analysis by MALDI-TOF MS.

[0501] For precipitation (Stults et al, (1991) Rapid Commun. Mass Spectrom. 5: 359-363) of both digests and PROBE products, 5 μl 3M ammonium acetate (pH 6.5), 0.5 μl glycogen (10 mg/ml, Sigma), and 110 μl absolute ethanol were added to 50 μl of the analyte solutions and stored for 1 hour at room temperature. After 10 min centrifugation at 13,000×g the pellet was washed in 70% ethanol and resuspended in 1 μl 18 Mohm/cm H₂O water. Where noted in the text, additional desalting was achieved by shaking 10-20 μL of ammonium saturated DOWEX (Fluka #44485) cation exchange beads in 40 μL of analyte. The beads, purchased in the protonated form, were pre-treated with three 5 min spin-decant steps in 2M NH₄OH, followed with H₂O and 10 mM ammonium citrate.

[0502] 0.35 μL of resuspended DNA was mixed with 0.35-1.3 μL matrix solutions (Wu et al. (1993) Rapid Commun. Mass Spectrom. 7:142-146) 0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H₂O:CH₃CN) on a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular masses (M_(r)(calc)) of the fragments were calculated from atomic compositions; the mass of a proton (1.08 Da) is subtracted from raw data values in reporting experimental molecular masses (Mr(exp)) as neutral basis. An external calibration generated from eight peaks (3000-18000 Da) was applied to all spectra.

[0503] RESULTS

[0504] Digestion with Cfol alone.

[0505] The inset to FIG. 57a shows a 12% polyacrylamide gel electrophoretic separation of an ε3/ε3 genotype after digestion of the 252 bp apo E amplified product with Cfol. Comparison of the electrophoretic bands with a molecular weight ladder shows the cutting pattern to be as mostly as expected (Table V) for the ε3/ε3 genotype. Differences are that the faint band at approximately 25 bp is not expected, and the smallest fragments are not observed. The accompanying mass spectrum of precipitated digest products shows a similar pattern, albeit at higher resolution. Comparison with Table V shows that the observed masses are consistent with those of single-stranded DNA; the combination of an acidic matrix environment (3-HPA, pK_(a) 3) and the absorption of thermal energy via interactions with the 337 nm absorbing 3-HPA upon ionization is known to denature short stretches of dsDNA under normal MALDI conditions (Tang, K et al., (1994) Rapid Commun Mass Spectrom 8:183-186).

[0506] The approximately 25-mers, unresolved with electrophoresis, are resolved by MS as three single stranded fragments; while the largest (7427 Da) of these may represent a doubly charged ion from the 14.8 kDa fragments (m=14850, z=2; m/z=7425), the 6715 and 7153 Da fragments could result from PCR artifacts or primer impurities; all three peaks are not observed when amplified products are purified with Qiagen purification kits prior to digestion. The Table V 8871 Da 29-mer sense strand 3′-terminal fragment is not observed; the species detected at 9186 Da is consistent with the addition of an extra base (9187-8871=316, consistent with A) by the Taq-polymerase during PCR amplification (Hu, G et al., (1993) DNA and Cell Biol 12:763-770). The individual single strands of each double strand with <35 bases (11 kDa) are resolved as single peaks; the 48-base single strands (M_(r)(calc) 14845 and 14858), however, are observed as an unresolved single peak at 14850 Da. Separating these into single peaks would require a mass resolution (m/

m, the ratio of the mass to the peak width at half height) of 14850/13=1140, nearly an order of magnitude greater than what is routine with the standard reflectron time-of-flight instrumentation used in this study; resolving such small mass differences with high performance instrumentation such as Fourier transform MS, which provides up to three orders of magnitude higher resolution in this mass range, has been demonstrated. The 91-mer single strands (Mr(calc)27849 and 28436) are also not resolved, even though this requires a resolution of only <50. The dramatic decrease in peak quality at higher masses is due to metastable fragmentation (i.e. depurination) resulting from excess internal energy absorbed during and subsequent to laser irradiation.

[0507] Simultaneous digestion with Cfol and Rsal.

[0508]FIG. 57b (inset) shows a 12% polyacrylamide gel electrophoresis separation of ε3/ε3 double digest products, with bands consistent with dsDNA with 24, 31, 36, 48, and 55 base pairs, but not for the smaller fragments. Although more peaks are generated (Table V) than with Cfol alone, the corresponding mass spectrum is more easily interpreted and reproducible since all fragments contain <60 bases, a size range far more appropriate for MALDI-MS if reasonably accurate M_(r) values (e.g., 0.1%) are desired. For fragments in this mass range, the mass measuring accuracy using external calibration is −0.1% (i.e. ≦±10 Da at 10 kDa). Significant depurination (indicated in Figure by asterisk) is observed for all peaks above 10 kDa, but even the largest peak at 17171 Da is clearly resolved from its depurination peak so that an accurate Mr can be measured. Although molar concentrations of digest products should be identical, some discrimination against those fragments with ≦11 bases is observed, probably due to their loss in the ethanol/glycogen precipitation step. The quality of MS results from simultaneous digestion with Cfol (or Hhal) and Rsal is superior to those with Cfol (or Hhal) alone, since the smaller fragments generated are good for higher mass accuracy measurements, and with all genotypes there is no possibility for dimer peaks overlapping with high mass diagnostic peaks. Since digestion by Rsal/Cfol and Rsal/Hhal produce the same restriction fragments but the former may be performed as a simultaneous digest since their buffer requirements are the same, this enzyme mixture was used for all subsequent genotyping by restriction digest protocols. TABLE V Mass and Copy Number of Expected Restriction Digest Products Table Va Cfol Digestion^(a) (+) (−) e2/e2 e2/e2 e2/e2 e2/e2 e2/e2 e2/e2  5781, 5999 — — 1 — 1 2 10752, 10921 — 1 1 2 2 2 14845, 14858 — 1 1 2 2 2 22102, 22440 — — 1 — 1 2 25575, 25763 2 1 1 — — — 27849, 28436 2 2 1 2 1 — Table Vb. Cfol/Rsal Digestion^(b) (+) (−) e2/e2 e2/e3 e2/e4 e3/e3 e3/e4 e4/e4  3428, 4025 — 1 1 2 2 2  5283, 5880 — — 1 — 1 2  5781, 5999 — — 1 — 1 2 11279, 11627 2 2 1 2 1 — 14845, 14858 — 1 1 2 2 2 18269, 18848 2 2 1 — — —

[0509] TABLE VI ddT M_(r) (Calc) ddT M_(r) (Exp) ddC M₅ (Calc) ddC M_(r) (Exp) e2/e2 ^(a)5918, ^(b)6768 — ^(a)6536, ^(b)7387 — e2/e3 ^(a)5918, ^(b)6768, 5919, 6769, ^(a)6536, ^(b)6753, 6542, 6752, ^(b)7965 7967 ^(b)7387 7393 e2/e4 ^(a)5918, ^(b)6768, — ^(a)5903, ^(b)6536, — ^(b)7965, ^(a)8970 ^(b)6753, ^(a)7387 e3/e3 ^(a)5918, ^(b)7965 5918, 7966 ^(a)6536, ^(b)6753 6542, 6756 e3/e4 ^(a)5918, ^(b)7965, 5914, 7959, ^(a)5903, ^(b)6536, 5898, 6533, ^(a)8970 8965 ^(b)6753 6747 e4/e4 ^(b)7965, ^(a)8970 7966, 8969 ^(a)5903, ^(b)6753 5900, 6752

[0510]FIG. 58a-c shows the ApoE ε3/ε3 genotype after digestion with Cfol and a variety of precipitation schemes; equal volume aliquots of the same amplified product were used for each. The sample treated with a single precipitation (FIG. 58a) from an ammonium acetate/ethanol/gly-cogen solution results in a mass spectrum characterized by broad peaks, especially at high mass. The masses for intense peaks at 5.4, 10.7, and 14.9 kDa are 26 Da (0.5%), 61 Da (0.6%), and 45 Da (0.3%) Da higher, respectively, than the expected values; the resolution (the ratio of a peak width at half its total intensity to the measured mass of the peak) for each of these is −50, and decreases with increasing mass. Such observations are consistent with a high level of nonvolatile cation adduction; for the 10.8 kDa fragment, the observed mass shift is consistent with a greater than unit ratio of adducted:nonadducted molecular ions.

[0511] MS peaks from a sample redissolved and precipitated a second time are far sharper (FIG. 58b), with resolution values nearly double those of the corresponding FIG. 58a peaks. Mass accuracy values are also considerably improved; each is within 0.07% of its respective calculated values, close to the independently determined instrumental limits for DNA measurement using 3-HPA as a matrix. Single (not shown) and double (FIG. 58C) precipitations with isopropyl alcohol (IPA) instead of ethanol result in resolution and mass accuracy values comparable to those for corresponding ethanol precipitations, but enhanced levels of dimerization are observed, again potentially confusing measurements when such dimers overlap with higher mass “diagnostics” monomers present in the solution. EtOH/ammonium acetate precipitation with glycogen as a nucleation agent results in nearly quantitative recovery of fragments except for the 7-mers, serving as a simultaneous concentration and desalting step prior to MS detection. Precipitation from the same EtOH/ammonium acetate solutions in the absence of glycogen results in far poorer recovery, especially at low mass.

[0512] The results indicate that to obtain accurate (M_(r)(exp) values after either 1 PA and EtOH precipitations, a second precipitation is necessary to maintain high mass accuracy and resolution.

[0513] The ratio of matrix:digest product also affects spectral quality; severe suppression of higher mass fragments (not shown) observed with 1:1 volume matrix: digest product (redissolved in 1 μL) is alleviated by using a 3-5 fold volume excess of matrix.

[0514] Apo E genotyping by enzymatic digestion. Codon 112 and 158 polymorphisms fall within Cfol (but not Rsal) recognition sequences. In the 252 bp amplified product studied here, invariant (i.e. cut in all genotypes) sites cause cuts after bases 31, 47, 138, 156, 239, and 246. The cutting site after base 66 is only present for ε4, while that after base 204 is present in ε3 and ε4; the ε2 genotype is cut at neither of these sites. These differences in the restriction pattern can be demonstrated as variations in mass spectra. FIG. 59 shows mass spectra from several ApoE genotypes available from a pool of 100 patients (Braun, A et al., (1992) Hum. Genet. 89:401-406). Vertical dashed lines are drawn through those masses corresponding to the expected Table V diagnostic fragments; other labeled fragments are invariant. Referring to Table V, note that a fragment is only considered “invariant” if it is present in duplicate copies for a given allele; to satisfy this requirement, such a fragment must be generated in each of the ε2m ε3, and ε4 alleles.

[0515] The spectrum in FIG. 59a contains all of the expected invariant fragments above 3 kDa, as well as diagnostic peaks at 3428 and 4021 (both weak), 11276 and 11627 (both intense), 14845, 18271, and 18865 Da. The spectrum in FIG. 59b is nearly identical except that the pair of peaks at 18 kDa is not detected, and the relative peak intensities, most notably among the 11-18 kDa fragments, are different. The spectrum in FIG. 59c also has no 18 kDa fragments, but instead has new low intensity peaks between 5-6 kDa. The intensity ratios for fragments above 9 kDa are similar to those of FIG. 59b except for a relatively lower 11 kDa fragment pair. FIG. 59d, which again contains the 5-6 kDa cluster of peaks, is the only spectrum with no 11 kDa fragments, and like the previous two also has no 18 kDa fragment.

[0516] Despite the myriad of peaks in each spectrum, each genotype can be identified by the presence and absence of only a few of the Table Vb diagnostic peaks. Due to the limited resolution of the MALDI-TOF instrumentation employed, the most difficult genotypes to differentiate are those based upon the presence or absence of the four diagnostic fragments between 5.2 and 6.0 kDa characteristic of the ε4 allele, since these fragments nearly overlap with several invariant peaks. It has been found herein that the 5283 Da diagnostic fragment overlaps with a depurination peak from the 5412 Da invariant fragment, and the 5781 Da diagnostic peak is normally not completely resolved from the 5750 Da invariant fragment. Thus, distinguishing between an ε2/ε4 and ε2/ε3, or between an ε3/ε4 and an ε3/ε3 allele, relies upon the presence or absence of the 5880 and 5999 Da fragments. Each of these is present in FIGS. 59c and 59 d, but not in 59 a or 59 b.

[0517] The genotype of each of the patients in FIG. 59 can be more rapidly identified by reference to the flowchart in FIG. 60. Consider the FIG. 59a spectrum. The intense pair of peaks at 11 kDa discounts the possibility of homozygous ε4, but does not differentiate between the other five genotypes. Likewise, the presence of the unresolved 14.8 kDa fragments is inconsistent with homozygous ε2, but leaves four possibilities (ε2/ε3, ε2/ε4, ε3/ε3, ε3/ε4). Of these only ε2/ε3 and ε2/ε4 are consistent with the 1 8 kDa peaks; the lack of peaks at 5283, 5879, 5779, and 5998, Da indicate that the FIG. 59a sample is ε2/ε3. Using the same procedure, the FIGS. 59b-d genotypes can be identified as ε3/ε3, ε3/ε4, and ε4/ε4, respectively. To date, all allele identifications by this method have been consistent with, and in many cases more easily interpreted than, those attained via conventional methods. The assignment can be further confirmed by assuring that fragment intensity ratios are consistent with the copy numbers of Table V. For instance, the 14.8 kDa fragments are of lower intensity than those at 16-17 kDa in FIG. 59a, but the opposite is seen in FIGS. 59b-d. This is as expected, since in the latter three genotypes the 14.8 kDa fragments are present in duplicate, but the first is a heterozygote containing e², so that half of the amplified products do not contribute to the 14.8 kDa signal. Likewise, comparison of the 11 kDa fragment intensify to those at 9.6 and 14.8 kDa indicate that this fragment is double, double, single, and zero copy in FIGS. 59a, d, respectively. These data confirm that MALDI can perform in a semi-quantitative way under these conditions.

[0518] ApoE genotyping by Primer Oligo Base Extension (PROBE). The PROBE reaction was also tested as a means of simultaneous detection of the codon 112 and 158 polymorphisms. A detection primer is annealed to a single-stranded PCR-amplified template so that its 3′ terminus is just downstream of the variable site. Extension of this primer by a DNA polymerase in the presence of three dNTPs and one ddXTP (that is not present as a dNTP) results in products whose length and mass depend upon the identity of the polymorphic base. Unlike standard Sanger type sequencing, in which a particular base-specific tube contains −99% dXTP and −1% ddXTP, the PROBE mixture contains 100% of a particular ddXTP combined with the other three dNTPs. Thus with PROBE a full stop of all detection primers is achieved after the first base complementary to the ddXTP is reached.

[0519] For the ε2/ε3 genotype, the PROBE reaction (mixture of ddTTP, dATP, dCTP, dGTP) causes a Mr(exp) shift of the codon 112 primer to 5919 Da, and of the codon 158 primer to 6769 and 7967 Da (Table VI); a pair of extension products results from the single codon 158 primer because the ε2/ε3 genotype is heterozygous at this position. Three extension products (one from codon 158, two from 112) are also observed from the heterozygote ε3/ε4 (FIG. 61c and Table VI), while only two products (one from each primer) are observed from the FIG. 61b (ε3/ε3) and FIG. 59d (ε4/ε4) homozygote alleles. Referring to Table VI, each of the available alleles result in all expected ddT reaction product masses within 0.1% of the theoretical mass, and thus each is unambiguously characterized by this data alone. Further configuration of the allele identities may be obtained by repeating the reaction with ddCTP (plus dATP, dTTP, dGTP); these results, summarized also in Table VI, unambiguously confirm the ddT results.

[0520] Appropriateness of the methods. Comparison of FIGS. 59 (restriction digestion) and 61 (PROBE) indicates that the PROBE method provides far more easily interpreted spectra for the multiplex analysis of codon 112 and 158 polymorphisms than does the restriction digest analysis. While the digests generate up to −25 peaks per mass spectrum and in some case diagnostic fragments overlapping with invariant fragments, the PROBE reaction generates a maximum of only two peaks per detection primer (i.e. polymorphism). Automated peak detection, spectrum analysis, and allele identification would clearly be far more straightforward for the latter. Spectra for highly multiplexed PROBE, in which several polymorphic sites from the same or different amplified products are measured from one tube, are also potentially simple to analyze. Underscoring its flexibility, PROBE data analysis can be further simplified by judicious a priori choice of primer lengths, which can be designed so that no primers or products can overlap in mass.

[0521] Thus while PROBE is the method of choice for large scale clinical testing of previously well characterized polymorphic sites, the restriction digest analysis as described here is ideally suited to screening for new mutations. The identity of each of the two polymorphisms discussed in this study affects the fragment pattern; if this is the only information used, then the MS detection is a faster alternative to conventional electrophoretic separation of restriction fragment length polymorphism products. The exact measurement of fragment M_(r) values can also give information on about sites completely remote from the enzyme recognition site since other single point mutations necessarily alter the mass of each of the single strands of the double stranded fragment containing the mutation. The 252 bp amplified product could also contain allelic variants resulting in, for example, previously described Glyl27 Asp (Weisgraber, KH etal., (1984) J. Clin. Invest. 73:1024-1033), Argl36Ser (Wardell, MR et al., (1987) J. Clin. Invest. 80:483-490), Argl42Cys (Horie, Y et al., (1992) J. Biol. Chem. 267:1962-1968), Arg145Cys (Rall SC Jr et al., (1982) Proc. Natl. Acad. Sci. U.S.A. 79:4696-4700), Lysl46Glu (Mann, Wash. et al., (1995) J. Clin. Invest. 96:1100-1107), or Lysl46Gln (Smit, M etal., (1990) J. Lipid Res. 31:45-53) substitutions. The G→A base substitution which codes for the Gly127 Asp amino acid substitution would result in a −16 Da shift in the sense strand, and in a +15 Da (C→T) shift in the antisense strand, but not in a change in the restriction pattern. Such a minor change would be virtually invisible by electrophoresis; however, with accurate mass determination the substitution could be detected; the invariant 55-mer fragment at 16240 (sense) and 17175 Da would shift to 16224 and 17190 Da, respectively. Obtaining the mass accuracy required to detect such minor mass shifts using current MALDI-TOF instrumentation, even with internal calibration, is not routine since minor unresolved adducts and/or poorly defined peaks limit the ability for accurate mass calling. With high performance electrospray ionization Fourier transform (ESI-FTMS) single Da accuracy has been achieved with synthetic oligonucleotides (Little, DP et al., (1995) Proc. Natl. Acad. Sci. U.S.A. 92:2318-2322) up to 100-mers (Little, DP etal., (1994) J. Am. Chem. Soc. 11 6:4893-4897), and similar results have recently been achieved with up to 25-mers using MALDI-FTMS (Li, Y et al., (1996) Anal. Chem. 68:2090-2096).

EXAMPLE 13

[0522] A Method for Mass Spectrometric Detection of DNA Fragments Associated With Telomerase Activity

[0523] INTRODUCTION

[0524] One-fourth of all deaths in the United States are due to malignant tumors (R. K. Jain, (1996) Science 271:1079-1080). For diagnostic and therapeutic purposes there is a high interest in reliable and sensitive methods of tumor cell detection.

[0525] Malignant cells can be distinguished from normal cells by different properties. One of those is the immortalization of malignant cells which enables uncontrolled cell-proliferation. Normal diploid mammalian cells undergo a finite number of population doublings in culture, before they undergo senescence. It is supposed that the number of population doublings in culture, before they undergo senescence. It is supposed that the number of population doublings is related to the shortening of chromosome ends, called telomers, in every cell division. The reason for said shortening is based on the properties of the conventional semiconservative replication machinery. DNA polymerases only work in 5′ to 3′ direction and need an RNA primer.

[0526] Immortalization is thought to be associated with the expression of active telomerase. Said telomerase is a ribonucleoprotein catalyzing repetitive elongation of templates. This activity can be detected in a native protein extract of telomerase containing cells by a special PCR-system (N. W. Kim et al. (1994) Science 266:2011-2015) known as telomeric repeat amplification protocol (TRAP). The assay, as used herein, is based on the telomerase specific extension of a substrate primer (TS) and a subsequent amplification of the telomerase specific extension products by a PCR step using a second primer (bioCX) complementary to the repeat structure. The characteristic ladder fragments of those assays are conventionally detected by the use of gel electrophoretic and labeling or staining systems. These methods can be replaced by MALDI-TOF mass spectrometry leading to faster accurate and automated detection.

[0527] MATERIALS AND METHODS

[0528] Preparation of cells

[0529] 1×10⁶ cultured telomerase-positive cells were pelleted, washed once with PBS (137 mM NaCl, 2.7 mM KCl, 4.3 mM Na₂HPO₄·7H₂O, 1.4 mM KH₂PO₄ in sterile DEPC water). The prepared cells may be stored at −75° C. Tissue samples have to be homogenized, according to procedures well known in the art, before extraction.

[0530] Telomerase extraction

[0531] Pellet was resuspended in 200 μl CHAPS lysis buffer (10 mM Tris-HCl pH 7.5, 1 mM MgCl₂, 1 mM EGTA, 0.1 mM benzamidine, 5 mM β-mercaptoethanol, 0.5% CHAPS, 10% glycerol) and incubated on ice for 30 min. The sample was centrifuged at 12,000 g for 30 min at 4° C. The supernatant was transferred into a fresh tube and stored at 75° C. until use.

[0532] TRAP-assay

[0533] 2 μl of telomerase extract were added to a mixture of 10×TRAP buffer (200 mM Tris-HCl pH 8.3, 15 mM MgCl₂, 630 mM KCl, 0.05% Tween 20, 10 mM EGTA) 50×dNTP-mix (2.5 mM each dATP, dTTP, dGTP, and dCTP), 10 pmol of TS primer and 50 pmol of bio CX primer in a final volume of 50 μl. The mixture was incubated at 30° C. for 10 minutes and 5 min. at 94° C., 2 units of Taq Polymerase were added and a PCR was performed with 30 cycles of 94° C. for 30 seconds, 50° C. for 30 seconds and 72° C. for 45 seconds.

[0534] Purification of TRAP-assay products

[0535] For every TRAP-assay to be purified, 50 μl Streptavidin M-280 Dynabeads (10 mg/ml) were washed twice with 1×BW buffer (5 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, 1 M NaCl). 50 μl of 2×BW buffer were added to the PCR mix and the beads were resuspended in this mixture. The beads were incubated under gentle shaking for 15 min. at ambient temperature. The supernatant was removed and the beads were washed twice with 1×BW buffer. To the beads 50 μl 25% ammonium hydroxide were added and incubated at 60° C. for 10 min. The supernatant was saved, the procedure repeated, both supernatants were pooled and 300 μl ethanol (100%) were added. After 30 min. the DNA was pelleted at 13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 ni ultrapure water.

[0536] MALDI-TOF MS of TRAP-assay products

[0537] 300 nl sample were mixed with 500 nl of saturated matrix-solution (3-HPA:ammonium citrate=10:1 molar ratio in 50% aqueous acetonitrile), dried at ambient temperature and introduced into the mass spectrometer (Vision 2000, Finigan MAT). All spectra were collected in reflector mode using external calibration.

[0538] Sequences and masses

[0539] bioCX: d(bio-CCC TTA CCC TTA CCC TTA CCC TAA SEQ ID NO. 45), mass: 7540 Da.

[0540] TS: d(AAT CCG TGC AGC AGA GTT SEQ ID NO.46), mass: 5523 Da.

[0541] Telomeric-repeat structure: (TTAGGG)_(n), mass of one repeat: 1909.2

[0542] Amplification products:

[0543] TS elongated by three telomeric repeats (first amplification product):

[0544] 12452 Da. (N₃)

[0545] TS elongated by four telomeric repeats: 14361 Da. (N₄)

[0546] TS elongated by seven telomeric repeats: 20088 Da. (N₇)

[0547] RESULTS

[0548]FIG. 62 depicts a section of a TRAP-assay MALDI-TOF mass spectrum. Assigned are the primers TS and bioCX at 5497 and 7537 Da, respectively (calculated 5523 and 7540 Da). The signal marked by an asterisk represents n-1 primer product of chemical DNA synthesis. The first telomerase specific TRAP-assay product is assigned at 1 2775 Da. This product represents a 40-mer containing three telomeric repeats. Due to primer sequences this is the first expected amplification product of a positive TRAP-assay. The product is elongated by an additional nucleotide due to extendase activity of Taq DNA polymerase (calculated non-extended product: 12452 Da, by A extended product: 12765 Da). The signal at 6389 Da represents the doubly charged ion of this product (calculated: 6387 Da). FIG. 63 shows a section of higher masses of the same spectrum as depicted in FIG. 62, therefore the signal at 12775 Da is identical to that in FIG. 62. The TRAP-assay product containing seven telomeric repeats, representing a 64-mer also elongated by an additional nucleotide, is detected at 20322 Da (calculated: 20395 Da). The signals marked 1, 2, 3 and 4 cannot be base-line resolved. This region includes of: 1. signal of dimeric n-1 primer, 2. second TRAP-assay amplification product, containing 4 telemeric repeats and therefore representing a 46-mer (calculated: 14341 Da/14674 Da for extendase elongated product) and 3. dimeric primer-ion and furthermore all their corresponding depurination signals. There is a gap observed between the signals of the second and fifth extension product. This signal gap corresponds to the reduced band intensities observed in some cases for the third and fourth extension product in autoradiographic analysis of TRAP-assays (N. W. Kim et al. (1994) Science 266:2013).

[0549] The above-mentioned problems, caused by the dimeric primer and related signals, can be overcome using an ultrafiltration step employing a molecular weight cut-off membrane for primer removal prior to MALDI-TOF-MS analysis. This will permit an unambiguous assignment of the second amplification product.

EXAMPLE 14

[0550] A method for Detecting Neuroblastoma-Specific Nested RT-amplified products Via MALDI-TOF Mass Spectrometry

[0551] Introduction

[0552] Neuroblastoma is predominantly a tumor of early childhood with 66% of the cases presenting in children younger than 5 years of age. The most common symptoms are those due to tumor mass, bone pain, or those caused by excessive catecholamine secretion. In rare cases, neuroblastoma can be identified prenatally (R. W. Jennings et al, (1993) J. Ped. Surgery 28:1168-1174). Approximately 70% of all patients with neuroblastoma have metastatic disease at diagnosis. The prognosis is dependent on age at diagnosis, clinical stage and other parameters.

[0553] For diagnostic purposes there is a high interest in reliable and sensitive methods of tumor cell detection, e.g., in control of autologous bone marrow transplants or on-going therapy.

[0554] Since catecholamine synthesis is a characteristic property of neuroblastoma cells and bone marrow cells lack this activity (H. Naito et al., (1991) Eur. J. Cancer 27:762-765), neuroblastoma cells or metastasis in bone marrow can be identified by detection of human tyrosine 3-hydroxylase (E.C. 1.14.16.2, hTH) which catalyzes the first step in biosynthesis of catecholamines.

[0555] The expression of hTH can be detected via reverse transcription (RT) polymerase chain reaction (PCR) and the amplified product can be analyzed via MALDI-TOF mass spectrometry.

[0556] Materials and methods

[0557] Cell- or tissue-treatment

[0558] Cultures cells were pelleted (10 min. 8000 rpm) and washed twice with PBS (137 mM NaCl, 2.7 mM KCl, 4.3 mM Na₂HPO₄·7H₂O, 1.4 mM KH₂PO₄ in sterile PEPC water). The pellet was resuspended in 1 ml lysis/binding buffer (100 mM Tris-HCl, pH 8.0, 500 mM LiCl, 10 mM EDTA, 1% Li-dodecyl sulfate, 5 mM DTT) until the solution becomes viscose. Viscosity was reduced by DNA-shear step using a 1 ml syringe. The lysate may be stored in −75° C. or processed further directly. Solid tissues (e.g., patient samples) have to be homogenized before lysis.

[0559] Preparation of magnetic Oligo-dT(25) beads

[0560] 100 μL beads per 1×10⁶ cells were separated from the storage buffer and washed twice with 200 μL lysis/binding buffer.

[0561] Isolation of poly A⁺ RNA

[0562] The cell lysate was added to the prepared beads and incubated for 5 min. at ambient temperature. The beads were separated magnetically for 2-5 min. and washed twice with 0.5 ml LDS (10 mM Tris-HCl, pH 8.0, 0.15 M LiCl, 1 mM EDTA, 0.1% LiDS).

[0563] Solid-phase first-strand cDNA synthesis

[0564] The poly A⁺ RNA containing beads were resuspended in 20 μL of reverse transcription mix (50 mM Tris-HCl, pH 8.3, 8 mM MgCl₂, 30 mM KCl, 10 mM DTT, 1.7 mM dNTPs, 3 U AMV reverse transcriptase) and incubated for 1 hour at 45° C. (with a resuspension step all ten min.). The beads were separated from the reverse transcription mix, resuspended in 50,uL of elution buffer (2 mM EDTA pH 8.0) and heated to 95° C. for 1 min. fur elution of the RNA. The beads with the cDNA first-strand can be stored in TB (0.089 M Tris-base, 0.089 M boric acid, 0.2 mM EDTA pH 8.0), TE 10 mM Tris-HCI, 0.1 mM EDTA, pH 8.0) or 70% ethanol for further processing.

[0565] Nested polymerase chain reaction Beads containing cDNA first-strand were washed twice with 1x PCR buffer (20 mM Tris-HCl pH 8.75, 10 mM KCI, 10 mM (NH₄)₂SO₄, 2 mM MgSO₄, 0.1% Triton X-100, 0.1 mg bovine serum albumin) and resuspended in PCR mix (containing 100) pmol of each outer primer, 2.5 u Pfu (exo-) DNA polymerase, 200 μM of each dNTP and PCR buffer in a final volume of 50 μL). The mixture was incubated at 72° C. 1 min. and amplified by PCR for 30 cycles. for the nested reaction: 1 μL of the first PCR was added as template to a PCR mix d(as above but nested primers instead of outer primers) and subjected to the following temperature program: 94° C. 1 min., 65° C. 1 min. and 72° C. 1 min. for 20 cycles.

[0566] Purification of nested amplified products

[0567] Primers and low-molecular reaction by-products are removed using 10,000 Da cut-off ultrafiltration-unit. Ultrafiltration was performed at 7,500 g for 25 minutes. For every PCR to be purified, 50 μL Streptavidin M-280 Dynabeads (10 mg/ml) were washed twice with 1×BW buffer (5 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, 1 M NaCl), added to the ultrafiltration membrane and incubated under gentle shaking for 15 min. at ambient temperature. The supernatant was removed and the beads were washed twice with 1×BW buffer. 50 μL 25% ammonium hydroxide were added to the beads and incubated at ambient temperature for 10 min. The supernatant was saved, the procedure repeated, both supernatants were pooled and 300 μL ethanol (100%) were added. After 30 min. the DNA was pelleted at 13,000 rpm for 12 min., the pellet was air-dried and resuspended in 600 nl ultrapure water.

[0568] MALDI-TOF MS of nested amplified products

[0569] 300 nl sample was mixed with 500 nl of saturated matrix-solution (3-HPA: ammonium citrate=10:1 molar ratio in 50% aqueous acetonitrile), dried at ambient temperature and introduced into the mass spectrometer (Vision 2000, Finigan MAT). All spectra were collected in reflector mode using external calibration.

[0570] Outer primers: hTH1: d(TGT CAG AGC TGG ACA AGT GT SEQ ID NO:47) hTH2: d(GAT ATT GTC TTC CCG GTA GC SEQ ID NO:48)

[0571] Nested primers: bio-hTH d(bio-CTC GGA CCA GGT GTA CCG CC SEQ ID NO:49), mass: 6485 Da hTH6; d(CCT GTA CTG GAA GGC GAT CTC SEQ ID NO:50), mass:6422 21 Da

[0572] mass of biotinylated single strand amplified product: 19253:6 Da

[0573] mass of nonbiotinylated single strand amplified product: 18758.2 Da

[0574] Results

[0575] A MALDI-TOF mass spectrum of a human tyrosine 3-hydroxylase (hTH) specific nested amplified product (61-mer) is depicted in FIG. 64. The signal at 18763 Da corresponds to non-biotinylated strand of the amplified product (calculated: 18758.2 Da, mass error: 0.02 Da). The signals below 10,000 and above 35,000 Da are due to multiply charged and dimeric amplified product-ions, respectively.

[0576] The product was obtained from a solid phase cDNA derived in a reverse transcription reaction from 1×10⁶ cells of a neuroblastoma cell-line (L-A-N-1) as described above. The cDNA first-strand was subjected to a first PCR using outer primers (hTH1 and hTH2), an aliquot of this PCR was used as template in a second PCR using nested primers (biohTH and hTH6). The nested amplified product was purified and MALDI-TOF MS analyzed:

[0577] The spectrum in FIG. 64 demonstrates the possibility of neuroblastoma cell detection using nested RT-PCR and MALDI-TOF MS analysis.

EXAMPLE 15

[0578] Rapid Detection of the RET Proto-oncogene Codon 634 Mutation Using Mass Spectrometry

[0579] Material and Methods

[0580] Probe

[0581] The identity of codon 634 in each of the three alleles was confirmed by Rsal enzymatic digestion, single strand conformational polymorphism or Sanger sequencing. Exon 11 of the RET gene was PCR amplified (40 cycles) from genomic DNA using Taq-Polymerase (Boehringer-Mannheim) with 8 pmol each of 5′-biotinylated forward (5′-biotin-CAT GAG GCA GAG CAT ACG CA-3′ SEQ ID NO:51) and unmodified reverse (5′-GAC AGC AGC ACC GAG ACG AT-3′ SEQ ID NO:52) primer per tube; amplified products were purified using the Qiagen “QlAquick” kit to remove unincorporated primers. 15 μl of amplified product were immobilized on 10 μL (10 mg/mL) Dynal streptavidin coated magnetic beads, denatured using the manufacturer's protocol, and the supernatant containing antisense strand discarded, the PROBE reaction was performed using thermoSequenase (TS) DNA Polymerase (Amersham) and Pharmacia dNTP/ddNTPs. 8 pmol of extension primer (5′-CGG CTG CGA TCA CCG TGC GG-3′ SEQ ID NO:53) was added to 13 μL H₂O, 2 μL TS-buffer, 2 μL 2 mM ddATP (or ddTTP), and 2 μL of 0.5 mM dGTP/dCTP/dTTP (or dGTP/DCTP/dATP), and the mixture heated for 30 sec @ 94° C., followed by 30 cycles of 10 sec @ 94° C. and 45 sec @ 50° C.; after a 5 min. incubation @ 95° C, the supernatant was decanted, and products were desalted by ethanol precipitation with the addition of 0.5 μL of 10 mg/mL glycogen. The resulting pellet was washed in 70% ethanol, air dried, and suspended in 1 μL H₂O. 300 nL of this was mixed with the MALDI matrix (0.7 M 3-hydroxypicolinic acid, 0.07 M ammonium citrate in 1:1 H₂O:CH₃CN) on a stainless steel sample probe and air dried. Mass, spectra were collected on a Thermo Bionalysis Vision 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Experimental masses (m_(r)(exp)) reported are those of the neutral molecules as measured using external calibration.

[0582] Direct Measurement of Diagnostic Products

[0583] PCR amplifications conditions for a 44 bp region containing codon 634 were the same as above but using Pfu polymerase; the forward primer contained a ribonucleotide at its 3′-terminus (forward, 5′-GAT CCA CTG TGC GAC GAG C (SEQ ID NO:54) -ribo; reverse, 5′-GCG GCT GCG ATC ACC GTG C (SEQ ID NO:55). After product immobilization and washing, 80 μL of 12.5% NH₄OH was added and heated at 80° C. overnight to cleave te primer from 44-mer (sense strand) to give a 25-mer. Supernatant was pipetted off while still hot, dried resuspended in 50 μL H₂O, precipitated, resuspended, and measured by MALDI-TOF as above. MALDI-FTMS spectra of 25-mer synthetic analogs were collected as previously described (Li, Y. et al., (1996) Anal. Chem. 68:2090-2096); briefly, 1-10 pmol DNA was mixed 1:1 with matrix on a direct insertion probe, admitted into the external ion source (positive ion mode), ionized upon irradiance with a 337 nm wavelength laser pulse, and transferred via rf-only quadruple rods into a 6.5 Tesla magnetic field where they were trapped collisionally. After a 15 second delay, ions were excited by a broadband chirp pulse and detected using 256K data points, resulting in time domain signals of 5 s duration. Reported (neutral) masses are those of the most abundant isotope peak after subtracting the mass of the charge carrying proton (1.01 Da).

[0584] Results

[0585] The first scheme presented utilizes the PROBE reaction shown schematically in FIG. 65. A 20-mer primer is designed to bind specifically to a region on the complementary template downstream of the mutation site; upon annealing to the template, which is labelled with biotin and immobilized to streptavidin coated magnetic beads, the PROBE primer is presented with a mixture of the three deoxynucleotide triphosphates (dNTPs), a di-dNTP (ddNTP), and a DNA polymerase (FIG. 65). The primer is extended by a series of bases specific to the identity of the variable base in codon 634; for any reaction mixture (e.g., ddA+dT+dC+dG), three possible extension products representing the three alleles are possible (FIG. 65).

[0586] For the negative control (FIG. 66), the PROBE reaction with ddATP+dNTPs (N=T, C, G) causes a Mr(exp) shift of the primer from 6135 to 6726 Da (Δm+591). The absence of a peak at 6432 rules out a C→A mutation (FIG. 65); the mass of the single observed peak is more consistent with extension by C-ddA (Mr(calc) 6721, +0.07% error) than by T-ddA (Mr(calc) 6736, −0.15% error) than of A₃TC₂G expected for C→A mutant. Combining the ddA and ddT reaction data, it is clear that the negative control is as expected homozygous normal at codon 634.

[0587] The ddA reaction for patient 1 also results in a single peak (M_(r)(exp)=6731) between expected values for wildtype and C→T mutation (FIG. 65b). The ddT reaction, however, results in two clearly resolved peaks consistent with a heterozygote wildtype (M_(r)(exp) 8249, +0.04% mass error)/C→T mutant (M_(r)(exp) 6428 Da, +0.08% mass error). For patient 2, the pair of FIG. 66c ddA products represent a heterozygote C→A (M_(r)(exp) 6431, -0.06% mass error)/normal (Mr(exp) 6719, −0.03% mass error) allele. The ddT reaction confirms this, with a single peak measured at 8264 Da consistent with unresolved wildtype and C→A alleles. The value of duplicate experiments is seen by comparing FIGS. 66a and 66 b; while for patient 1 the peak at 6726 from the ddA reaction represents only one species, similar peak from patient 1 is actually a pair of unresolved peaks differing in mass by 15 Da.

[0588] An alternate scheme for point mutation detection is differentiation of alleles by direct measurement of diagnostic product masses. A 44-mer containing the RET634 site was generated by the PCR, and the 19-mer sense primer removed by NH₄OH cleavage at a ribonucleotide at its 3′ terminus.

[0589]FIG. 67 shows a series of MALDI-FTMS spectra of synthetic analogs of short amplified products containing the RET634 mutant site. FIGS. 67a-c and 67 d-f are homozygous and heterozygous genotypes, respectively. An internal calibration was done using the most abundant isotope peak for the wildtype allele; application of this (external) calibration to the five other spectra resulted in better than 20 ppm mass accuracy for each. Differentiation by mass alone of the alleles is straightforward, even for heterozygote mixtures whose components differ by 16.00 (FIG. 67d), 2501 (FIG. 67e), or 9.01 Da (FIG. 65f). The value of high performance MS is clear when recognition of small DNA mass shifts is the basis for diagnosis of the presence or absence of a mutation. The recent reintroduction of delayed extraction (DE) techniques has improved the performance of MALDI-TOF with shorts DNAs (Roskey, M. T. et al., (1996) Anal. Chem. 68:941-946); a resolving power (RP) of >10³ has been reported for a mixed-base 50-mer, and a pair of 31-mere with a C or a T (Δm 15 Da) at a variable position resolved nearly to baseline. Thus DE-TOF-MS has demonstrated the RP required for separation of the individual components of heterozygotes. Even with DE, however, the precision of DNA mass measurement with TOF is typically 0.1% (8 Da at 8 kDa) using external calibration, sufficiently high to result in incorrect diagnoses. Despite the possibility of space charge induced frequency shifts (Marshall, A. G. etaL. (1991) Anal. Chem. 63:215A-229A), MALDI-FTMS mass errors are rarely as high as 0.005% (0.4 Da at 8 KDa), making internal calibration unnecessary.

[0590] The methods for DNA point mutation presented here are not only applicable to the analysis of single base mutations, but also to less demanding detection of single or multiple base insertions or deletions, and quantification of tandem two, three, or four base repeats. The PROBE reaction yields products amenable to analysis by relatively low performance ESI or MALDI instrumentation; direct measurement of short amplified product masses is an even more direct means of mutation detection,and will likely become more widespread with the increasing interest in high performance MS available with FTMS.

EXAMPLE 16

[0591] Immobilization of nucleic acids on solid supports via an acid-labile covalent bifunctional trityl linker

[0592] Aminolinked DNA was prepared and purified according to standard methods. A portion (10eq) was evaporated to dryness on a speedvac and suspended in anhydrous DMF/pyridine (9:1; 0.1 ml). To this was added the chlorotrityl chloride resin (1 eq, 1.05 μmol/mg loading) and the mixture was shaken for 24 hours. The loading was checked by taking a sample of the resin, detritylating this using 80% AcOH, and measuring the absorbance at 260 nm. Loading was ca. 150pmol/mg resin.

[0593] In 80% acetic acid, the half-life of cleavage was found to be substantially less than 5 minutes—this compares with trityl ether-based approaches of half-lives of 105 and 39 minutes for para and meta substituted bifunctional dimethoxytrityl linkers respectively. Preliminary results have also indicated that the hydroxy picolinic acid matrix alone is sufficient to cleave the DNA from the chlorotrityl resin.

EXAMPLE 17

[0594] Immobilization of nucleic acids on solid supports via hydrophobic trityl linker

[0595] The primer contained a 5′-dimethoxytrityl group attached using routine trityl-on DNA synthesis.

[0596] Cl8 beads from an oligo purification cartridge (0.2 mg) placed in a filter tip was washed with acetonitrile, then the solution of DNA (50 ng in 25 μl) was flushed through. This was then washed with 5% acetonitrile in ammonium citrate buffer (70 mM, 250 μl). To remove the DNA form the Cl8, the beads were washed with 40% acetonitrile in water (10 μl) and concentrated to ca 2 μl on the Speedvac. The sample was then submitted to MALDI.

[0597] The results showed that acetonitrile/water at levels of ca.>30% are enough to dissociate the hydrophobic interaction. Since the matrix used in MALDI contains 50% acetonitrile, the DNA can be released from the support and successfully detected using MALDI-TOF MS (with the trityl group removed during the MALDI process).

[0598]FIG. 69 is a schematic representation of nucleic acid immobilization via hydrophobic trityl linkers.

EXAMPLE 18

[0599] Immobilization of nucleic acids on solid supports via Streptavidin-Iminobiotin

[0600] Experimental Procedure

[0601] 2-iminobiotin N-hydroxy-succinimid ester (Sigma) was conjugated to the oligonucleotides with a 3′- or 5-′amino linker following the conditions suggested by the manufacturer. The completion of the reaction was confirmed by MALDI-TOF MS analysis and the product was purified by reverse phase HPLC.

[0602] For each reaction, 0.1 mg of streptavidin-coated magnetic beads (Dynabeads M-280 Streptavidin from Dynal) were incubated with 80 pmol of the corresponding oligo in the presence of 1M NaCl and 50 mM ammonium carbonate (pH 9.5) at room temperature for one hour. The beads bound with oligonucleotides were washed twice with 50 mM ammonium carbonate (pH 9.5). Then the beads were incubated in 2 μl of 3-HPA matrix at room temperature for 2 min. An aliquot of 0.5 μl of supernatant was applied to MALDI-TOF. For biotin displacement experiment, 1.6. mol of free biotin (80-fold excess to the bound oligo) in 1 μl of 50 mM ammonium citrate was added to the beads. After a 5 min. incubation at room temperature, 1 μl of 3-HPA matrix was added and 0.5 μl, of supernatant was applied to MALDI-TOF MS. To maximize the recovery of the bound iminobiotin oligo, the beads from the above treatment were again incubated with a 2 μl of 3-HPA matrix and 0.5 μl of supernatant was applied to MALDI-TOF MS. The matrix alone and free biotin treatment quantitatively released iminobiotin oligo off the streptavidin beads as shown in FIGS. 70 and 71.

EXAMPLE 19

[0603] Mutation Analysis Using Loop Primer Oligo Base Extension MATERIALS AND METHODS

[0604] Genomic DNA. Genomic DNA was obtained from healthy individuals and patients suffering from sickle cell anemia. The wildtype and mutated sequences have been evaluated conventionally by standard Sanger sequencing.

[0605] PCR-Amplification. PCR amplifications of a part of the β-globin was established and optimized to use the reaction product without a further purification step for capturing with streptavidin coated bead. The target amplification for LOOP-PROBE reactions were performed with the loop-cod5 d(GAG TCA GGT GCG CCA TGC CTC AAA CAG ACA CCA TGG CGC, SEQ ID No. 58) as forward primer and β-11-bio d(TCT CTG TCT CCA CAT GCC CAG, SEQ ID. No. 59) as biotinylated reverse primer. The underlined nucleotide in the loop-cod5 primer is mutated to introduce an invariant Cfol restriction site into the amplicon and the nucleotides in italics are complementary to a part of the amplified product. The total PCR volume was 50p1 including 200 ng genomic DNA, 1U Taq-polymerase (Boehringer-Mannheim, Cat# 1596594), 1.5 mM MgCl₂, 0.2 mM dNTPs (Boehringer-Mannheim, Ca# 1277049), and 10 pmol of each primer. A specific fragment of the β-globin gene was amplified using the following cycling condition: 5 min 94° C. followed by 40 cycles of : 30 sec @ 94° C., 30 sec @ 56° C, 30 sec @ 72° C., and a final extension of 2 min at 72° C.

[0606] Capturing and denaturation of biotinylated templates. 10 μl paramagnetic beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat# 11 2.06) and treated with 5×binding solution (5 M NH₄Cl, 0.3M NH₄OH) were added to 40 μl PCR volume (10 μl of the amplified product was saved for check electrophoresis). After incubation for 30 min at 37° C. the supernatant was discarded. The captured templates were denatured with 50 μl 100 mM NaOH for 5 min at ambient temperature, then washed once with 50 μl 50 mM NH₄OH and three times with 100 μl 10 mM Tris.Cl, pH 8.0. The single stranded DNA served as templates for PROBE reactions.

[0607] Primer oligo base extension (PROBE) reaction. The PROBE reactions were performed using Sequenase 2.0 (USB Cat# E70775Z including buffer) as enzyme and dNTPs and ddNTPs supplied by Boehringer-Mannheim (Cat# 1277049 and 1008382). The ratio between dNTPs (dCTP, dGTP, dTTP) and ddATP was 1:1 and the total used concentration was 50 μM of each nucleotide. After addition of 5 μl -fold Sequenase-buffer the beads were incubated for 5 min at 65° C. and for 10 min at 37° C. During this time the partially self complementary primer annealed with the target site. The enzymatic reaction started after addition of 0.5 μl 100 mM dithiothreitol (DTT), 3.5 μl dNTP/ddNTP solution, and 0.5 μl Sequenase (0.8 U) and incubated at 37° C. for 10 min. Hereafter, the beads were washed once in 1-fold TE buffer (10 mM Tris, 1 mM EDTA, pH 8.0).

[0608] Cfol restriction digest. The restriction enzyme digest was performed in a total volume of 5 μl using 10 U Cfol in 1 -fold buffer L purchased from Boehringer-Mannheim. The incubation time was 20 min at 37° C.

[0609] Conditioning of the diagnostic products for mass spectrometric analysis

[0610] After the restriction digest, the supernatant was precipitated in 45 μl H₂O, 10 μl 3M NH₄- acetate (pH 6.5), 0.5 μl glycogen (10 mg/ml in water, Sigma, Cat# G1765), and 110 μl absolute ethanol for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min the pellet was washed in 70% ethanol and resuspended in 2 μl 18 Mohm/cm H₂O. The beads were washed in 100 μl 0.7 M NH⁴⁻ citrate followed by 100 μl 0.05 M NH⁴⁻ citrate. The diagnostic products were obtained by heating the beads in 2 μl 50 mM NH₄OH at 80° C. for 2 min.

[0611] Sample preparation and analysis on MALDI-TOF mass spectrometry.

[0612] Same preparation was performed by mixing 0.6 μl of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium citrate in 1:1 H₂O:CH₃CN) with 0.3 μl of either resuspended DNA/glycogen pellet or supernatant after heating the beads in 50 mM NH₄OH on a sample target and allowed to air dry. The sample target was automatically introduced in to the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction linear mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical molecular mass (M_(r)(calc)) were calculated from atomic compositions; reported experimental (M_(r)(exp)) values are those of the singly-protonated form.

[0613] RESULTS

[0614] The LOOP-PROBE has been applied to the detection of the most common mutation of codon 6 of the human β-globin gene leading to sickle cell anemia. The single steps of the method are schematically presented in FIG. 72. For the analysis of codon 6, a part of the β-globin gene was amplified by PCR using the biotinylated reverse primer β11 bio and the primer loop-cod5 which is modified to introduce a Cfol recognition site (FIG. 72a). The amplified product is 192 bp in length. After PCR the amplification product was bound to streptavidin coated paramagnetic particles as described above. The antisense strand was isolated by denaturation of the double stranded amplified product (FIG. 72b). The intra-molecule annealing of the complementary 3′ end was accomplished by a short heat denaturation step and incubation at 37° C. The 3′ end of the antisense strand is now partially double stranded (FIG. 72c). For analyzing the DNA downstream of the self annealed 3′-end of the antisense strand, the primer oligo base extension (PROBE) has been performed using ddATP, dCTP, dGTP, dTTP (FIG. 72d). This generates different products in length specific for the genotype of the analyzed individual. Before the determination of the length of these diagnostic products, the DNA was incubated with the Cfol restriction endonuclease that cuts 5′ of the extended product. This step frees the stem loop from the template DNA whereas the extended product still keeps attached to the template. The extended products are then denatured by heating from the template stand and analyzed by MALDI-TOF mass spectrometry.

[0615] Since the MALDI-TOF analyses were performed with a non-calibrated instrument, the mass deviation between observed and expected values was approximately 0.6% higher than theoretically calculated. Nevertheless, the results obtained were conclusive and reproducible within repeated experiments. In all analyzed supernatants after the restriction digest the stem loop could be detected. Independent of the genotype, the stem loop has had in all analyses molecular masses about 8150 Da (expected 8111 Da). An example is shown in FIG. 73a. The second peak in this figure with a mass of 4076 Da is a doubly charged ion of the stem loop. FIG. 73b to 73 d show the analyses of different genotypes as indicated in the respective inserts. HbA is the wildtype genotype and HbC and HbS are two different mutations in codon 6 of the β-globin gene which cause sickle cell disease. In the wildtype situation a single peak with a molecular mass of 4247 Da and another with 6696 Da are detected (FIG. 73b). The latter corresponds to the biotinylated PCR primer (β-11-bio) unused in the PCR reaction which also has been removed in some experiments. The former corresponds to the diagnostic product for HbA. The analyses of the two individual DNA molecules with HbS trait as well as compound heterozygosity (HbS/HbC) for the sickle cell disorder lead also to unambiguous expected results (FIG. 73c and 73 d).

[0616] In conclusion, the LOOP-PROBE is a powerful means for detection of mutations especially predominant disease causing mutations or common polymorphisms. The technique eliminates one specific reagent for mutation detection and, therefore, simplifies the process and makes it more amenable to automation. The specific extended product that is analyzed is cleaved off from the primer and is therefore shorter compared to the conventional method. In addition, the annealing efficiency is higher compared to annealing of an added primer and should therefore generate more product. The process is compatible with multiplexing and various detection schemes (e.g., single base extension, oligo base extension and sequencing). For example, the extension of the loop-primer can be used for generation of short diagnostic sequencing ladders within highly polymorphic regions to perform, for example, HLA typing or resistance as well as species typing (e.g., Mycobacterium tuberculosis)).

EXAMPLE 20

[0617] T7-RNA Polymerase Dependent Amplification of CKR-5 and Detection by MALDI-MS

[0618] MATERIALS AND METHODS

[0619] Genomic DNA. Human genomic DNA was obtained from healthy individuals.

[0620] PCR-Amplification and Purification. PCR amplification of a part of the CKR-5 gene was accomplished using ckrT7f as sense primer d(A CC TAG CGT TCA GTT CGA CTG AGA TAA TAC GAC TCA CTA TAG CAG CTC TCA TTT TCC ATA C (SEQ ID NO. 60). The underlined sequence 25 corresponds to the sequence homologous to CKR-5, the bolded sequence corresponds to the T7-RNA polymerase promoter sequence and the italic sequence was chosen randomly. ckr5r was used as antisense primer d(AAC TAA GCC ATG TGC ACA ACA (SEQ ID NO. 61). Purification of the amplified product and removal of unincorporated nucleotides was carried out using the QlAquick purification kit (Qiagen, cat# 28104). In the final PCR volume of 50 μl were 200 ng genomic DNA, 1U Taq-polymerase (Boehringer-Mannheim, cat# 1596594), 1.5 mM MgCl₂ 0.2 mM dNTPs (Boehringer-Mannheim, cat# 1277049), and 10 pmol of each primer. The specific fragment of the CKR-5 gene was amplified using the following cycling conditions: 5 min @ 94° C. followed by 40 cycles of 45 sec @ 94° C., 45 sec 52° C., 5 sec @ 72° C, and a final extension of 5 min at 72° C.

[0621] T7-RNA Polymerase conditions. One third of the purified DNA (about 60ng) was used in the T7-RNA polymerase reaction. (Boehringer-Mannheim, cat# 881 767). The reaction was carried out for 2h at 37° C. according to the manufacturer's conditions using the included buffer. The final reaction volume was 20 μl 0.7 μl RNasin (33 U/μl) had been added. After the extension reaction, the enzyme was inactivated by incubation for 5 min at 65° C.

[0622] DNA digestion and conditioning of the diagnostic products for mass spec analysis.

[0623] The template DNA was digested by adding RNase-free DNase I (Boehringer-Mannheimn, cat# 776 758) to the inactivated T7 mixture and incubation for 20 min at room temperature. Precipitation was carried out by adding 1 μl glycogen (10 mg/ml, Sigma, cat# G1765), {fraction (1/10)} volume 3M NH²⁻ acetate (pH 6.5), and 3 volume absolute ethanol and incubation for 1 hour at room temperature. After centrifugation at 13,000 g for 10 min, the pellet was washed in 70% ethanol and resuspended in 3 μl 18 Mohm/cm H₂O. 1 μl was analyzed on an agarose gel.

[0624] Sample preparation and analysis on MALDI-TOF mass spectrometry

[0625] Sample preparation was performed by mixing 0.6 μl of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic ammonium citrate in 1:1 H₂O:CH₃CN) with 0.3 μl of resuspended DNA/glycogen on a sample target and allowed to air dry. The sample target was introduced into the source region of an unmodified Finnigan VISION2000 MALDI-TOF operated in relectron mode with 5 kV. The theoretical molecular mass was calculated form atomic composition; reported experimental values are those of singly-pronated form.

[0626] RESULTS

[0627] The chemokine receptor CKR-5 has been identified as a major coreceptor in HIV-1 (see e.g., WO 96/39437 to Human Genome Sciences; Cohen, J. etal. Science 275:1261). A mutant allele that is characterized by a 32 bp deletion is found in 1 6% of the HIV-1 seronegative population whereas the frequency of this allele is 35% lower in the HIV-1 seropositive population. It is assumed that individuals homozygous for this allele are resistant to HIV-1. The T7-RNA polymerase dependent amplification was applied to identify this specific region of the chemokine receptor CKR-5 (FIG. 74). Human genomic DNA was amplified using conventional PCR. The sense primer has been modified so that it contains a random sequence of 24 bases that facilitate polymerase binding and the T7-RNA polymerase promoter sequence (FIG. 75). The putative start of transcription is at the first base 5′ of the promoter sequence. ckr5r was used as an antisense primer. PCR conditions are outlined above. The amplified product derived from wildtype alleles is 75 bp in length. Primer and nucleotides were separated from the amplification product using the Qiagen QlAquick purification kit. One third of the purified product was applied to in vitro transcription with T7-RNA polymerase. To circumvent interference of the template DNA, it was digested by adding RNase-free DNase I. RNA was precipitated and this step also leaves the degraded DNA in the supernatant. Part of the redissolved RNA was analyzed on an agarose gel and the rest of the sample was prepared for MALDI-TOF analysis. The expected calculated mass of the product is 24560 Da. A dominant peak, that corresponds to an approximate mass of 25378.5 Da can be observed. Since the peak is very broad, an accurate determination of molecular mass was not possible. The peak does not correspond to residual DNA template. First, the template DNA is digested, and second, the DNA strands would have a mass of 23036.0 and 23174 Da, respectively.

[0628] This example shows that T7 RNA polymerase can effectively amplify target DNA. The generated RNA can be detected by Mass spectrometry. In conjunction with modified (e.g., 3′-deoxy)ribonucleotides that are specifically incorporated by a RNA polymerase but not extended any further, this method can be applied to determine the sequence of a template DNA.

EXAMPLE 21

[0629] MALDI Mass Spectrometry of RNA Endonuclease Digests MATERIALS

[0630] Synthetic RNA (Sample A:5′-UCCGGUCUGAUGAGUCCGUGAGGAC-3′ (SEQ ID 62); sample B:5′-GUCACUACAGGUGAGCUCCA-3′ (SEQ ID NO 63); sample C:5′-CCAUGCGAGAGUAAGUAGUA-3′ (SEQ ID NO. 64)) samples were obtained from DNA technology (Aahus, Denmark) and purified on a denaturing polyacrylamide gel (Shaler, T. A. etal. (1996) Anal. Chem. 63:5766-579). Rnases T₁ (Eurogentec), U₂ (Calbiochem), A (Boehringer-Mannheim) and PhyM (Pharmacia) were used without additional purification. Streptavidin-coated magnetic beads (Dynabeads M-280 Streptavidin, Dynal) were supplied as a suspension of 6-7×10⁸ bead/ml (10 mg/ml) dissolved in phosphate-buffered saline (PBS) containing 0.1 % BSA and 0.02% NaN₃. 3-Hydroxypicolinic acid (3-HPA) (Aldrich) was purified by a separate desalting step before use as described in more detail elsewhere (Little, D. P. et al (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 2318-2322).

[0631] METHODS

[0632] In vitro transcription reaction. The 5′-biotinylated 49 nt in vitro transcript (SEQ ID No. 65): AGGCCUGCGGCAAGACGGAAAGACCAUGGUCCCUNAUCUGCCGCAGGAUC was produced by transcription of the plasmid pUTMS2 (linearized with the restriction enzyme BamHI) with T7 RNA polymerase (Promega). For the transcription reaction 3 μg template DNA and 50u T7 RNA polymerase were used in a 50 μl volume of 1 u/μl RNA guard (Rnax inhibitor, Pharmacia), 0.5 mM NTP's 1.0 mM 5′-biotin-ApG dinucleotide, 40 mM Tris-HCl (pH 8.0), 6 mM MgCl₂ 2 mM spermidine and 10 mM DTT. Incubation was performed at 37° C. for 1 hour, then another aliquot of 50 units T7 RNA polymerase was added and incubation was continued for another hour. The mixture was adjusted to 2M NH⁴⁻ acetate and the RNA was precipitated by addition of one volume of ethanol and one volume of isopropanol. The precipitated RNA was collected by centrifugation at 20,000 X g for 90 min at 4° C., the pellet was washed with 70% ethanol, dried and redissolved at 8 M urea. Further purification was achieved by electrophoresis through a denaturing polyacrylamide gel as described elsewhere (Shaler, T. A. et al. (1996) Anal. Chem. 68:576-579). The ration of 5′-biotinylated to non-biotinylated transcripts was about 3:1.

[0633] Ribonuclease assay. For partial digestion with selected RNases different enzyme concentrations ad assay conditions were employed as summarized in table VII. The solvents for each enzyme were selected following the suppliers' instructions. The concentrations of the synthetic RNA samples and the in vitro transcript were adjusted to 5-10×10⁻⁶M. TABLE VII Overview and Assay Conditions of the RNAses Concentration Incubation Time [units Rnase/ (max. number Rnase Source μgRNA] Conditions of fragments] References T₁ Aspergillus oryzae 0.2 20 mM Tris-HCl,  5 min. Donis-Keller, H. et al., (1977) pH 5.7, 37° C. Nuc. Acids Res. 4:2527-2537 U₂ Ustilago 0.01 20 mM DAC, pH 5.0, 15 min Donis-Keller, H. et al., (1977) Sphaerogena 37° C. Nuc. Acids Res. 4:2527-2537 PhyM Physarum 20 20 mM DAC, pH 5.0, 15 min Donis-Keller, H. et al., (1980) polycephalu m 50° C. Nucl. Acids Res. 8:3133-3142 A bovine pancrease 4 × 10⁻⁹ 10 mM Tris-HCl, pH 30 min Breslow, R. and R. Xu. 7.5, 37° C. (1993) Proc. Natl. Acad. Sci. USA 90:1201-1207 CL₃ chicken liver 0.01 10 mM Tris-HCl, pH 30 min Boguski, et al., (1980) J. 6.5, 37° C. Biol. Chem. 255:2160-2163 cusativin cucumis sativus L. 0.05 ng 10 mM Tris-HCl, pH 30 min Rojo, M.A. et al. (1994) 6.5, 37° C. Planta 194:328-338

[0634] The reaction was stopped at selected times by mixing 0.6 μl aliquots of the assay with 1.5 μl of 3 HPA-solution. The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis.

[0635] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 μl) of 25% ammonium hydroxide and RNA sample (5-10×10⁻⁶M) at 60° C. 1 μl aliquots were taken out at selected times and dried in a stream of cold air. For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1.5 μl of the matrix solution and 0.7 μl of NH₄+ loaded cation exchanged polymer beads were added.

[0636] The reaction was stopped at selected times by mixing 0.6 μl aliquots of the assay with 1.5 μl of 3HPA-solution. The solvent was subsequently evaporated in a stream of cold air for the MALDI-MS analysis.

[0637] Limited alkaline hydrolysis was performed by mixing equal volumes (2.0 μl) of 25% ammonium hydroxide and RNA sample (5-10×10⁶M) at 60° C. 1 μl aliquots were taken out at selected times and dried in a stream of cold air. For these samples it turned out to be important to first dry the digests in a stream of cold air, before 1.5 μl of the matrix solution and 0.7 μl if a suspension of NH₄+ loaded cation exchange polymer beads were added.

[0638] Separation of 5′-biotinylated fragments. Steptavidin-coated magnetic beads were utilized to separate 5′-biotinylated fragments of the in vitro transcript after partial RNase degradation. The biotin moiety in this sample was introduced during the transcription reaction initiated by the 5′-biotin-pApG-dinucleotide. Prior to use, the beads were washed twice with 2×binding & washing (b&w) buffer (20 mM Tris-HCl, 2 mM EDTA, 2 M NaCl pH 8.2) and resuspended at 10 mg/ml in 2×b&w buffer. Circa 25 pmol of the RNA in vitro transcript were digested by RNase U2 using the protocol described above. The digestion was stopped by adding 3 μl of 95% formamide containing 10 mM trans- 1,2-diaminocyclohexane-N,N,N¹,N¹-tetraacetic acid (CDTA) at 90° C. for 5 min, followed by cooling on ice. Subsequently, capture of the biotinylated fragments was achieved by incubation of 6 μl of the digest with 6 μl of the bead suspension and 3 μl of b&w buffer at room temperature for 15 min. Given the binding capacity of the beads of 200 pmol of biotinylated oligonucleotide per mg of beads, as specified by the manufacturer, the almost 2-times excess of oligonucleotide was used to assure a full loading of the beads. The supernatant was removed, and the beads were washed twice with 6 μl of H₂O. The CDTA and 95% formamide at 90° C. for 5 min. After evaporation of the solvent and the formamide the <2.5 pmol of fragments were resuspended in 2 μl H₂O and analyzed by MALDI-MS as described above.

[0639] Sample preparation for MALDI-MS. 3-Hydroxypicolinic acid (3-HPA) was dissolved in ultra pure water to a concentration of ca. 300 mM. Metal cations were exchanged against NH₄+as described in detail previously. (Little, D. P. et al. (1995) Proc. Natl. Acad. Sci. U.S.A. 92: 2318-2322). Aliquots of 0.6 μl of the analyte solution were mixed with 1.5 μl 3-HPA on a flat inert metal substrate. Remaining alkali cations, present in the sample solution as well as on the substrate surface, were removed by the addition of 0.7 μl of the solution of NH₄ ⁺—loaded cation exchange polymer beads. During solvent evaporation, the beads accumulated in the center of the preparation, were not used for the analysis, and were easily removed with a pipette tip.

[0640] Instrument. A prototype of the Vision 2000 (ThermBioanalysis, Hemel, Hempstead, UK) reflectron time of flight mass spectrometer was used for the mass spectrometry. Ions were generated by irradiation with a frequency-tripled ND:YAG laser (355 nm, 5 ns; Spektrum GmbH, Berlin, Germany) and accelerated to 10 ke V. Delayed ion extraction was used for the acquisition of the spectra shown, as it was found to substantially enhance the signal to noise ratio and/or signal intensity. The equivalent flight path length of the system is 1.7 m, the base pressure is 10⁻⁴ Pa. Ions were detected with a discrete dynode secondary-electron multiplier (R2362, Hamamatsu Photonics), equipped with a conversion dynode for effective detection of high mass ions. The total impact energy of the ions on the conversion dynode was adjusted to values ranging from 1 6 to 25 keV, depending on the mass to be detected. The preamplified output signal of the SEM was digitized by a LeCroy 9450 transient recorder (LeCroy, Chestnut Ridge, N.Y., USA) with a sampling rate of up to 400 MHz. For storage and further evaluation, the data were transferred to a personal computer equipped with custom- made software (ULISSES). All spectra shown were taken in the positive ion mode. Between 20 and 30 single shot spectra were averaged for each of the spectra shown.

[0641] RESULTS

[0642] Specificity of Rnases. Combining base-specific RNA cleavage with MALDI-MS requires reaction conditions optimized to retain the activity and specificity of the selected enzymes on the one hand and complying with the boundary conditions for MALDI on the other. Incompatibility mainly results because the alkaline-ion buffers, commonly used in the described reaction, such as Na-phosphate, Na-citrate or Na-acetate as well as EDTA interfere with the MALDI sample preparation; presumably they disturb the matrix crystallization and/or analyte incorporation. Tris-HCl or ammonium salt buffers, in contrast, are MALDI compatible (Shaler, T. A. et al. (1996) Anal. Chem. 68:576-579). Moreover, alkaline salts in the sample lead to the formation of a heterogenous mixture of multiple salts of the analyte, a problem increasing with increasing number of phosphate groups. Such mixtures result in loss of mass resolution and accuracy as well as signal-to-noise ratio (Little, D. P. et al (1995) Proc. Natl. Acad. Sci. U.S.A. 92:2318-2322; Nordhoff, E., Cramer, R. Karas, M., Hillenkamp, F., Kirpekar, F., Kristiansen, K. and Roepstorff, P. (1993) Nucleic Acids Res., 21, 3347-3357). Therefore, RNase digestions were carried out under somewhat modified conditions compared to the ones described in the literature. They are summarized above in table VII. For Rnase T₁, A, CL₃ ad Cusativin, Tris-HCI (pH 6-7.5) was used as buffer. 20 mM DAC provides the pH of 5, recommended for maximum activity of RNases U₂ and PhyM. The concentration of 10-20 mM of these compounds were found to not interfere significantly with the MALDI analysis. To examine the specificity of the selected ribonucleases under these conditions, three synthetic 20-25mer RNA molecules with different nucleotide sequences were digested.

[0643] The MALDI-MS spectra of FIG. 77 shows five different cleavage patterns (A-E) of a 25 nt RNA obtained after partial digestion with RNases T₁, U₂, PhyM, A, and alkaline hydrolysis. These spectra were taken from aliquots which were removed from the assay after empirically determined incubation times, chosen to get an optimum coverage of the sequence. As the resulting samples were not fractionated prior to mass spectrometric analysis, they contain all fragments generated at that time by the respective RNases. In practice, uniformity of the cleavages, can be affected by a preferential attack on the specific phophodiester bonds (Donis-Keller, H., Maxam, A. M., and Gilbert, W. (1977) Nucleic Acids Res., 4, 1957-1978; Donis-Keller, H. (1980) Nucleic Acids Res., 8 3133-3142). The majority of the expected fragments are indeed observed in the spectra. It is also worth noting that for the reaction protocols as used, correct assignment of all fragment masses is only possible, if a 2′, 3′-cyclic phosphate group is assumed. It is well known that such cyclic phosphates are intermediates in the cleavage reaction and get hydrolyzed in a second, independent and slower reaction step involving the enzyme (Richards, F. M., and Wycoff, H. W. in The Enzymes Vol. 4, 3rd Ed., (ed. Boyer, P. D.) 746-806 (1971, Academic Press, New York); Heinemann, U and W. Saenger (1985) Pure Appl. Chem. 57, 417-422; Ikehara, M. et al., (1987) Pure Appl. Chem. 59-965-968) Vreslow, R. and Xu, R. (1993) Proc. Nal. Acad. Sci. USA, 90, 1201-1207). In a few cases different fragments have equal mass of differ by as little as 1 Dalton., In these cases, mass peaks cannot unambiguously be assigned to one or the other fragments. Digestion of two additional different 20 nt RNA samples was, therefore, performed (Hahner, S., Kirpekar, F., Nordoff, E., Kristiansen, K., Roepstorff, P. and Hillenkamp, F. (1996) Proceedings of the 44th ASMS Conference on Mass Spectrometry, Portland, Oreg.) in order to sort out these ambiguities. For all samples tested, the selected ribonucleases appear to cleave exclusively at the specified nucleotides leading to fragments arising from single as well as multiple cleavages.

[0644] In FIG. 77, peaks, indicating fragments containing the original 5′-terminus, are marked by arrows. All non marked peaks can be assigned to internal sequences or those with retained 3′-terminus. For a complete sequence all possible fragments bearing exclusively either the 5′- or the 3′-terminus of the original RNA would suffice. In practice, the 5′-fragments are better suited for this purpose, because the spectra obtained after incubation of all three synthetic RNA samples contain the nearly complete set of originals of 5′-ions for all different RNases (Hahner, S., Kirpekar, F., Nordoff, E., Kristiansen, K., Roepstorff and Hillenkamp, F. (1996) Proceedings of the 44th ASMS Conference on Mass Spectrometry, Portland, Oreg.). Internal fragments are somewhat less abundant and fragments containing the original 3′-terminus appear suppressed in the spectra. In agreement with observations reported in the literature (Gupta, R. C. and Randerath, K. (1977) Nucleic Acids Res., 4, 1957-1978), cleavages close to the 3′-terminus were partially suppressed in partial digests of the RNA 25 mer by RNase T₁, and U₂ (even if they are internal or contain the original 5′-terminus). Fragments from such cleavages appear as weak and poorly resolved signals in the mass spectra.

[0645] For larger RNA molecules secondary structure is known to influence the uniformity of the enzymatic cleavages (Donis-Keller, H., Maxam, A. M. and Gilbert, A. (1977) Nucleic Acids Res. 8, 3133-3142). This can, in principle be, overcome by altered reaction conditions. In assay solutions containing 5-7 M urea, the activity of RNases such as T₂, U₂, A, Cl₃, and PhyM is known to be retained (Donis-Keller, H., Maxam, A. M. and Gilbert, W. (1977) Nucleic Acids Res., 4, 2527-2537; Boguski, M. S., Hieter, P. A., and Levy, C. C. (1980) J. Biol. Chem.,255, 2160-2163; Donis-Keller, H. (1980) Nucleic Acids Res., 8, 3133-3142, while RNA is sufficiently denatured. UV-MALDI-analysis with 3-HPA as matrix is not possible under such high concentrations of urea in the sample. Up to a concentration of 2 M urea in the reaction buffer, MALDI analysis of the samples was still possible although significant changes in matrix crystallization were observed. Spectra of the RNA 20 mer (sample B), digested in the presence of 2 M urea still resembled those obtained under conditions listed in Table VII.

[0646] Digestion by RNases which exclusively recognize one nucleobase is desirable to reduce the complexity of the fragment patterns and thereby facilitate the mapping of the respective nucleobase. RNases CL₃ and cursavitin are enzymes reported to cleave at cytidylic acid residues. Upon limited RNase CL₃ and cursativin digestion of the RNA-20mer (sample B) under non-denaturing conditions, fragments corresponding to cleavages at cytidylic residues were indeed observed (FIG. 78). Similar to the data reported so far (Boguski, M. S., Hieter, P. A. and Levy, C. C. (1980) J. BioL Chem., 255, 2160-2163: Rojo, M. A., Arias, F. J., Iglesias, R., Ferreras, J. M., Munoz, R., Escarmis, C., Soriano, F., Llopez-Fando, Mendez, E., and Girbes, T. (1994) Planta, 194, 328-338). The degradation pattern in FIG. 78, however, reveals that not every cytidine residue is recognized, especially for neighboring C residues. RNase CL₃ is also reported to be susceptible to the influence of secondary structure (Boguski, M. S., Heiter, P. A., and Levy, C. C. (1980) J Biol. Chem., 255, 2160-2163), but for RNA of the size employed in this study, such an influence should be negligible. Therefore, unrecognized cleavage sites in this case can be attributed to a lack of specificity of this enzyme. To confirm these data, a further RNase CL₃-digestion was performed with the RNA 20mer (sample C). As a result of the sequence of this analyte, all three linkages containing cytidylic acid were readily hydrolyzed, but additional cleavages at uridylic acid residues were detected as well. Since altered reaction conditions such as increased temperature (90° C.), various enzyme to substrate ratios, and addition of 2M urea did not result in a digestion of the expected specificity, application of this enzyme to sequencing was not pursued further. Introduction of a new cytidine-specific ribonuclease, cusativin, isolated form dry seeds of Cucumis sativus L. looked promising for RNA sequencing (Rojo, M. A., Arias, F. J., Iglesias, R., Ferreras, J. M., Munoz, R., Escarmis, C., Soriano, F., Llopez-Fando, J., Mendez, E. and Girbes, T. (1994) Planta, 194, 328-338). As shown in FIG. 78, not every cytidine residue was hydrolyzed and additional cleavages occurred at uridylic acid residues for the recommended concentration of the enzyme. RNases CL₃ and cusativin will, therefore not yield the desired sequence information for mapping of cytidine residues and their use was not further pursued. The distinction of pyrimidine residues can be achieved, however, by use of RNases with multiple specificities, such as Physarum polycephalum RNase (cleaves ApN, UpN) and pancreatic RNase A (cleaves UpN, CpN) (see FIG. 77). All 5′-terminus fragments, generated by the monospecific RNase U₂ and apparent in the spectrum of FIG. 77C were also evident in the spectrum of the RNase PhyM digest (FIG. 77D). Five of the six uridilic cleavage sites could, this way, be uniquely identified by this indirect method. In a next step, the knowledge of the uridine cleavage sites was used to identify sites of cleavage of cytidilic acid residues in the spectrum recorded after incubation with RNase A (FIG. 77E), again using exclusively ions containing the original 5′-terminus. Two of the four expected cleavage sites were identified this way. A few imitations are apparent from these spectra, if only the fragments containing the original 5′-terminus are used for the sequence determination. The first two nucleotides usually escape the analysis, because their signals get lost in the low mass matrix background. Because of this, the corresponding fragments are missing in the spectra of the U- and C-specific cleavages. Large fragments with cleavage sites close to the 3′-terminus are often difficult to identify, particularly in digests with RNases T^(1,) and U₂, because of their low yield (vide supra) and the often strong nearby signal of the non-digested transcript. Accordingly the cleavages in position 22 and 23 do not show up in the spectrum of the G-specific RNase T, (FIG. 77A) and the cleavage site 24 cannot be identified from the spectra of the U₂ and PhyM digests (FIGS. 77 C and D). Also site 16 and 17 with two neighboring cytidilic acids cannot be identified in the RNase A spectrum of FIG. 77E. These observations demonstrate that a determination of exclusively the 5′-terminus fragments may not always suffice and the information contained in the internal fragments may be needed for a full sequence analysis.

[0647] Finally, limited alkaline hydrolysis provides a continuum of fragments (FIG. 77B), which can be used to complete the sequence data. Again, the spectrum is dominated by ions of fragments containing the 5′-terminus, although the hydrolysis should be equal for all phosphodiester bonds. As was true for the enzymatic digests, correct mass assignments requires one to assume that all fragments have a 2′, 3′-cyclic phosphate. The distribution of peaks, therefore, resembles that obtained after a 3′-exonuclease digest (Pieles, U., Zurcher, W., Schar, M. and Moser, H. E., (1993) Nucleic Acids Res., 21, 3191-3196; Nordhoff, E. et al. (1993) Book of Abstracts, 13^(th) Internat. Mass Spectrom. Conf., Budapest p. 218; Kirpekar, F., Nordhoff, E., Kristiansen, K., Roepstorff, P., Lezius, A. Hahner, S., Karas, M. and Hillenkamp. F. (1994) Nucleic Acid Res., 22, 3866-3870). In principle, the alkaline hydrolysis alone could, therefore, be used for a complete sequencing. This is, however, only possible for quite small oligoribonucleotides, because larger fragment ions, differing in mass by only a few mass units will not be resolved in the spectra and the mass of larger ions cannot be determined with the necessary accuracy of better than 1 Da, even if peaks are partially or fully resolved. The interpretation of the spectra particularly from digests of unknown RNA samples is substantially simplified, if only the fragments containing the original 5′-terminus are separated out prior to the mass spectrometric analysis. A procedure for this approach is described in the following section.

[0648] Separation of 5′-biotinylated fragments. Streptavidin-coated magnetic beads (Dynal) were tested for the extraction of fragments containing the original 5′-terminus from the digests. Major features to be checked for this solid-phase approach are the selective immobilization and efficient elution of biotinylated species. In preliminary experiments, a 5′-biotinylated DNA (19 nt) and streptavidin were incubated and MALDI analyzed after standard preparation. Despite the high affinity of the streptavidin-biotin interaction, the intact complex was not found in the MALDI spectra. Instead, signals of the monomeric subunit of streptavidin and the biotinylated DNA were detected. Whether the complex dissociates in the acidic matrix solution (pKA 3) or during the MALDI desorption process, is not known. Surprisingly, if the streptavidin is immobilized on a solid surface such as magnetic beads, the same results are not observed. A mixture of two 5′-biotinylated DNA samples (19 nt and 27 nt) and two unlabeled DNA sequences (12 nt and 22 nt) were incubated with the beads. The beads were extracted and carefully washed before incubation in the 3-HPA MALDI matrix. No analyte signals could be obtained from these samples. To test whether the biotinylated species had been bound to the beads altogether, elution form the extracted and washed beads was performed by heating at 90° C. in the presence of 95% formamide. This procedure is expected to denature the streptavidin, thereby breaking the streptavidin/biotin complex. FIG. 79B shows the expected signals of the two biotinylated species, proving that release of the bound molecules in the MALDI process is the problem rather than the binding of the beads; FIG. 79A shows a spectrum of the same sample after standard preparation, showing signals of all four analytes as a reference. Complete removal of the formamide after the elution and prior to the mass spectrometric analysis was found to be important, otherwise crystallization of the matrix is disturbed. Mass resolution and the signal-to-noise ration in spectrum 79B are comparable to those of the reference spectrum. These results testify to the specificity of the streptavidin-biotin interaction, since no or only minor signals of the non-biotinylated analyte were detected after incubation with the Dynal beads. Increased suppression of nonspecific binding was reported through an addition of the detergent Tween-20 to the binding buffer (Tong, X. and Smith, L. M. (1992) Anal Chem., 64, 2672-2677). Although this effect could be confirmed in this study, peak broadening affected the quality of the spectra due to remaining amounts of the detergent. The necessity of an elution step as a prerequisite for detection of the captured biotinylated species can be attributed to a stabilizing effect of the complex by the immobilization of the streptavidin to the magnetic beads.

[0649] For practical application of this solid phase method to sequencing a maximum efficiency of binding and elution of biotinylated species is of prime importance. Among a variety of conditions investigated so far, addition of salts such as EDTA gave best results in the case of DNA sequencing by providing ionic strength to the buffer (Tong, X. and Smith, L. M. (1992) Anal Chem.,64, 2672-2677). To examine such an effect on the solid-phase method, several salt additives were tested for the binding and elution of the 5′-biotinylated RNA in vitro transcript (49 nt). The results are shown in FIG. 80. Judging from the relative intensity, signal-to-noise ration, and resolution of the respective signals, a 95% formamide solution containing 10 mM CDTA (FIG. 80D) is most efficient for the binding/elution. Since CDTA acts as a chelating agent for divalent cation, formation of proper secondary an tertiary structure of the RNA is prevented. An improved sensitivity and spectral resolution has been demonstrated under such conditions for the analysis of RNA samples by electrospray mass spectrometry (Limbach, P. A., Crain, P. F. and McCloskey, J. A. (1995) J Am. Soc. Mass. Spectrom., 6, 27-39). The improvement in the MALDI analysis is actually not very significant compared to the spectrum obtained for the solution containing formamide alone (FIG. 81b), but the reproducibility for spectra of good quality was substantially improved for the CDTA/formamide solution. Thus in addition to the improved binding/elution, this additive may also improve the incorporation of the analyte into the matrix crystals. Unfortunately, a striking signal broadening on the high mass side was observed in case of formamide solutions containing EDTA, CDTA or 25% ammonium hydroxide. Since this effect is most prominent in case of 25% ammonium hydroxide and this agent was also used for adjusting EDTA and CDTA to their optimum pH, a pronounced NH₃ adduct ion formation ca be assumed.

[0650] The applicability of streptavidin-coated magnetic beads separation to RNA sequencing was demonstrated for the Rnase U₂ digest of the 5′-biotinylated RNA in vitro transcript (49 nt) (FIG. 81). The entire fragment pattern obtained after incubation with Rnase U₂ is shown is spectrum 81A. Separation of the biotinylated fragments reduces the complexity of the spectrum (FIG. 81 B) since only 5′-terminal fragments are captured by the beads. The signals in the spectrum are broadened and the increased number of signals in the low mass range indicate that even after stringent washing of the beads, some amounts of buffer and detergent used for the binding and elution remained. Further improvements of the method are, therefore, needed. Another possible strategy for application of the magnetic beads is the immobilization of the target RNA prior to RNase digestion by an elution of the remaining fragments for further analysis. Cleavage of the RNA was impeded in this case, as evidenced by a prolonged reaction time for the digest under otherwise identical reaction conditions.

EXAMPLE 22

[0651] Parallel DNA Sequencing Mutation Analysis and Microsatellite Analysis Using Primers with Tags and Mass Spectrometric Detection

[0652] This EXAMPLE describes specific capturing of DNA products generated in DNA analysis. The capturing is mediated by a specific tag (5 to 8 nucleotides long) at the 5′ end of the analysis product that binds to a complementary sequence. The capture sequence can be provided by a partially double stranded oligonucleotide bound to a solid support. Different DNA analysis (e.g., sequencing, mutation, diagnostic, microsatellite analysis) can be carried out in parallel, using, for example, a conventional tube or microtiter plate (MTP). The products are then specifically captured and sorted out via the complementary identification sequence on the tag oligonucleotide. The capture oligonucleotide can be bound onto a solid support (e.g., silicon chip) by a chemical or biological bond. Identification of the sample is provided by the predefined position of the capute oligonucleotide. Purification, conditioning and analysis by mass spectrometry are done on solid support. This method was applied for capturing specific primers that had a 6 base tag sequence.

[0653] MATERIALS AND METHODS

[0654] Genomic DNA.

[0655] Genomic DNA was obtained from healthy individuals.

[0656] PCR Amplification

[0657] PCR amplifications of part of the β-globin gene were established using β2 d(CATTTGCTTCTGACACAACT Seq. ID. No. 66) as forward primer and β11 d(TCTCTGTCTCCACATGCCCAG Seq. ID. No. 67) as reverse primer. The total PCR volume was 50 μl including 200 ng genomic DNA, 1 U Taq-polymerase (Boehringer-Mannheim, Cat# 159594), 1.5 mM MgCl₂, 0.2 mM dNTPs (Boehringer-Mannheim, Cat# 1277049), and 10 pmol of each primer. A specific fragment of the β-globin gene was amplified using the following cycling conditions: 5 min @ 94° C. followed by 40 cycles of 30 sec @ 94° C., 45 sec @ 53° C., 30 sec @ 72° C., and a final extension of 2 min @ 72° C. Purification of the amplified product and removal of unincorporated nucleotides was carried out using the QlAquick purification kit (Qiagen, Cat 28104). One fifth of the purified product was used for the primer oligo base extension (PROBE) or sequencing reactions, respectively.

[0658] Primer oligo base extension (PROBE) and sequencing reactions

[0659] Detection of putative mutations in the human β-globin gene at codon 5 and 6 and at codon 30 and in the IVS-1 donor site, respectively, was done in parallel (FIG. 82A). β-TAG1 (GTCGTCCCATGGTGCACCTGACTC Seq. ID. No. 68) served as primer to analyze codon 5 and 6 and β-TAG2 (CGCTGTGGTGAGGCCCTGGGCA Seq. ID. No. 69) for the analyses of codon 30 and the IVS-1 donor site. The primer oligo base extension (PROBE) reaction was done by cycling, using the following conditions: final reaction volume was 20 μl, β-TAG1 primer (5 pmol), β-TAG2 primer (5 pmol), dCTP, dGTP, dTTP, (final concentration each 25 μM), ddATP (final concentration 100 μM) dNTPs and ddNTPs purchased from Boeringer-Mannheim, Cat# 1277049 and 1008382), 2 μl of 10×ThermoSequence buffer and 2.5 U ThermoSequenase (Amersham, CAT#E79000Y). The cycling program was as follows: 5 min @ 94° C., 30 sec @ 53° C., 30 sec @ 72° C. and a final extension step for 8 min @ 72° C. Sequencing was performed under the same conditions except that the reaction volume was 25 μl and the concentration of nucleotides was 250 μM for ddNTP.

[0660] Capturing using TAG sequence and sample preparation

[0661] The capture oligonucleotides cap-tag1 d(GACGACGACTGCTACCTGACTCCA Seq ID No. 70) and cap-tag2 d(ACAGCGGACTGCTACCTGACTCCA Seq ID No. 71), respectively, were annealed to equimolar amounts of uni-as d(TGGAGTCAGGTAGCAGTC Seq ID No. 72) (FIG. 82A). Each oligonucleotide had a concentration of 10 pmol/μl in ddH₂O and incubated for 2 min @ 80° C. and 5 min @ 37° C. This solution was stored at −20° C. and aliquots were taken. 10 pmol annealed capture oligonucletides were bound to 10 μl paramagnetic beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat# 112.06) by incubation for 30 min @ 37° C. Beads were captured and the PROBE or sequencing reaction, respectively, was added to the capture oligonucleotides. To facilitate binding of β-TAG1 abd β-TAG2, respectively, the reaction was incubated for 5 min @ 25° C. and for 30 min @ 16° C. The beads were washed twice with ice cold 0.7 M NH₄ Citrate to wash away unspecific bound extension products and primers. The bound products were dissolved by adding 1 μl DDH₂O and incubation for 2 min @ 65° C. and cooling on ice. 0.3 μl of the sample were mixed with 0.3 μl matrix solution (saturated 3-hydroxy-picolinic acid, 10% molar ratio ammonium-citrate in acetonitrile/water (50/50. v/v)) and allowed to air dry. The sample target was automatically introduced into the source region of an unmodified Perspective Voyager MALDI-TOF operated in delayed extraction linear mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M_(r)(calc)) were calculated from atomic compositions; reported experimental M_(r)(M_(r)(exp)) values are those of the singly-pronated form.

[0662] RESULTS

[0663] Specific capturing of a mixture of extension products by a short complementary sequence has been applied to isolate sequencing and primer oligo base extension (PROBE) products. This method was used for the detection of putative mutations in the human β-globin gene at codon 5 and 6 and at codon 30 and IVS-1 donor site, respectively (FIG. 82A). Genomic DNA has been amplified using the primers β2 and β11. The amplification product was purified and the nucleotides separated. One fifth of the purified product was used for analyses by primer oligo base extension. To analyze both sites in a single reaction, primers, β-TAG1 and β-TAG2, were used respectively. β-TAG1 binds upstream of codons 5 and 6 and β-TAG2 upstream of codon 30 and the IVS-1 donor site. Extension of these primers was performed by cycling in the presence of ddATP and dCTP, dGTP and dTTP, leading to specific products, depending on the phenotype of the individual. The reactions were then mixed with the capture oligonucleotides. Capture oligonucleotides include the biotinylated capture primer cap-tag1 and cap-tag2, respectively. They have 6 bases at the 5′ end, that are complementary to the 5′ end of β-TAG1 and β-TAG2, respectively. Therefore, they specifically capture these primers and the extended products. By annealing a universal oligonucleotide (uni-as) to the capture oligonucleotide, the capture primer is transformed into a partially double stranded molecule where only the capture sequence stays single stranded (FIG. 82). This molecule is then bound to streptavidin coated paramagnetic particles, to which the PROBE or sequencing reaction, respectively is added. The mixture was washed to bind only the specifically annealed oligonucleotides. Captured oligonucleotides are dissolved and analyzed by mass spectrometry.

[0664] PROBE products of one individual (FIG. 83) show a small peak with a molecular mass of 7282.8 Da. This corresponds to the unextended β-TAG1 that has a calculated mass of 7287.8 Da. The peak at 8498.6 Da corresponds to a product, that has been extended by 4 bases. This corresponds to the wildtype situation. The calculated mass of this product is 8500.6 Da. There is no significant peak indicating a heterozygote situation. Furthermore only β-TAG1 and not β-TAG2 has been captured, indicating a high specificity of this method.

[0665] Analyses of what was bound to cap-tag2 (FIG. 84) shows only one predominant peak with a molecular mass of 9331.5 Da. This corresponds to an extension of 8 nucleotides. It indicates a homozygous wildtype situation where the calculated mass of the expected product is 9355 Da. There is no significant amount of unextended primer and only β-TAG2 has been captured.

[0666] To prove that this approach is also suitable for capturing specific sequencing products, the same two primers β-TAG1 and β-TAG2, respectively, were used. The primers were mixed, used in one sequencing reaction and then sorted by applying the above explained method. Two different termination reactions using ddATP and ddCTP were performed with these primers (FIGS. 85 and 86, respectively). All observed peaks in the spectrograms correspond to the calculated masses in a wildtype situation.

[0667] As shown above, parallel analysis of different mutations (e.g., different PROBE primers) is now possible. Further, the described method is suitable for capturing specific sequencing products. Capturing can be used for separation of different sequencing primers out of one reaction tube/well, isolation of specific multiplex-amplified products, PROBE products, etc. Conventional methods, like cycle sequencing, and conventional volumes can be used. A universal chip design permits the use of many different applications. Further, this method can be automated for high throughput.

EXAMPLE 23

[0668] Deletion Detection by Mass-Spectrometry

[0669] Various formats can be employed for mass spectrometer detection of a deletion within a gene. For example, molecular mass of a double standard amplified product can be determined, or either or both of the strands of a double stranded product can be isolated and the mass measured as described in previous examples.

[0670] Alternatively, as described herein, a specific enzymatic reaction can be performed and the mass of the corresponding product can be determined by mass spectrometry. The deletion size can be up to several tenths of vases in length, still allowing the simultaneous detection of the wildtype and mutated allele. By simultaneous detection of the specific products, it is possible to identify in a single reaction whether the individual is homozygous or heterozygous for a specific allele or mutation.

[0671] MATERIALS AND METHODS

[0672] Genomic DNA

[0673] Leukocyte genomic DNA was obtained from unrelated healthy individuals.

[0674] PCR amplification

[0675] PCR amplification of the target DNA was established and optimized to use the reaction products without a further purification step for capturing with streptavidin coated beads. The primers for target amplification and for PROBE reactions were as follows:

[0676] CKRΔ-F:d(CAG CTC TCA TTT TCC ATA C SEQ ID. NO. 73) and CKRΔ-R bio: d(AGC CCC AAG ATG ACT ATC SEQ ID. NO. 74). CKR-5 was amplified by the following program: 2 min @ 94° C., 45 seconds @ 52° C., 5 seconds @ 72° C., and a final extension of 5 minutes at 72° C. The final volume was 50 μl including 200 ng genomic DNA 1U Taq-polymerase (Boehringer-Mannheim, Cat # 1596594), 1.5 Mm MgCl₂, 0.2 Mm DNTPS (Boehringer-Mannheim, Cat # 1277049), 10 pmol of unmodified forward primers, and 8 pmol 5′ biotinylated reverse primer.

[0677] Capturing and Denaturation of Biotinylated Templates

[0678] 10 μl paramagnetic beads coated with streptavidin (10 mg/ml; Dynal, Dynabeads M-280 streptavidin Cat # 112.06) in 5×binding solution (5M NH₄Cl, 0.3 M NH₄OH) were added to 45 μl PCR reaction (5 μl of PCR reaction were saved for electrophoresis). After binding by incubation for 30 min. at 37° C. the supernatant was discarded. Captured templates were denatured with 50 μl of 100 Mm NaOH for 5 min. at ambient temperature, washed once with 50 μl 50 Mm NH₄OH and three times with 100 μl 10 Mm Tris/Cl, Ph 8.0. The single stranded DNA served as templates for PROBE reactions.

[0679] Primer Oligo Base Extension (PROBE) Reaction

[0680] The PROBE reaction was performed using Sequence 2.0 (USB Cat # E70775Z including buffer). dATP/DGTP and ddTTP were supplied by Boehringer-Mannheim (Cat # 1277049 and 1008382). d(CAG CTC TCA TTT TCC ATA C (SEQ ID. NO. 73) was used as PROBE primer (FIG. 87). The following solutions were added tot he beads: 3.0 μl H₂O, 1.0 μl reaction buffer, 1.0 μl PROBE primer (10 pmol) and incubated at 65° C. for 5 minutes followed by 37° C. for 10 min. Then 0.5 μl DTT, 3.5 μl DNTPS/ddntp each 50 μM and 0.5,ul Sequenase (0.8 U) were added and incubated at 37° C. for 10 min.

[0681] T4 Treatment of DNA

[0682] To generate blunt ended DNA, amplification products were treated with T4 DNA polymerase (Boehringer-Mannheim Cat# 1004786). The reactions were carried out according to the manufacturer's protocol for 20 min. at 11° C.

[0683] Direct Size Determination of Extended Products

[0684] To determine the size of the amplified product, MALDI-TOF was applied to one strand of the amplification product. samples were bound to beads, as described above, conditioned and denatured, as described below.

[0685] DNA Conditioning

[0686] After the PROBE reaction the supernatant was discarded and the beads were washed first in 50 μl 700 mM NH₄-citrate and second 50 μl 50 mM NH₄-citrate. The generated diagnostic products were removed for the template by heating the beads in 2 μl H₂O at 80° C. for 2 min. The supernatant was used for MALDI-TOF analysis.

[0687] Sample Preparation and Analysis with MALDI-TOF Mass

[0688] Spectrometry

[0689] Sample preparation was performed by mixing 0.6 μl of matrix solution (0.7 M 3-hydroxypicolinic acid, 0.07 M dibasic citrate in 1:1 H₂O:CH₃CN) with 0.3 μl of diagnostic PROBE products in water on a sample target and allowed to air dry. Up to 100 samples were spotted on a probe target disk for introduction into the source region of an unmodified Perspective Voyager MALDI-TOF instrument operated in linear mode with delayed extraction and 5 and 30 kV on the target and conversion dynode, respectively. Theoretical average molecular mass (M_(r)(calc)) of analytes were calculated from atomic compositions, reported experimental M_(r)(M_(r)(exp)) values are those of the singly-pronated form, determined using internal calibration with unextended primers in the case of PROBE reactions.

[0690] Conventional Analyses

[0691] Conventional analyses were performed by native polyacrylamide gel electrophoresis according to standard protocols. The diagnostic products were denatured with formamide prior to loading onto the gels and stained with ethidium bromide or silver, respectively.

[0692] RESULTS

[0693] The CKR-5 status of 10 randomly chosen DNA samples of healthy individuals were analyzed. Leukocyte DNA was amplified by PCR and an aliquot of the amplified product was analyzed by standard polyacrylamide gel electrophoresis and silver staining of the DNA (FIG. 88). Four samples showed two bands presumably indicating heterozygosity for CKR-5, whereas the other 6 samples showed one band, corresponding to a homozygous gene (FIG. 88). In the case where two bands were observed, they correspond to the expected size of 75 bp for the wildtype gene and 43 bp for the allele with the deletion (FIG. 87). Where one band was observed, the size was about 75 bp which indicated a homozygous wildtype CKR-5 allele. One DNA sample derived from a presumably heterozygous one from a homozygous individual were used for all further analysis. To determine the molecular mass of the amplified product, DNA was subjected to matrix assisted laser desorption/ionization coupled with time of flight analysis (MALDI-TOF). Double stranded DNA, bound to streptavidin coated paramagnetic particles, was denatured and the strand released into the supernatant was analyzed. FIG. 89A shows a spectrograph of a DNA sample, that was supposed to be heterozygous according to the result derived by polyacrylamide gel electrophoresis (FIG. 88). The calculated mass of the sense strand for a wildtype gene is 23036 Da and for the sense strand carrying the deletion allele 13143 (FIG. 87 and Table VI). Since many thermostable polymerases unspecifically add an adenosine to the 3′ end of the product, those masses were also calculated. They are 23349 and 13456 Da. The masses of the observed peaks (FIG. 89A) are 23119 Da, which corresponds to the calculated mass of a wildtype DNA strand where an adenosine has been added (23349 Da). Since no peak with a mass of about 23036 Da was observed, the polymerase must have qualitatively added adenosine. Two peaks, which are close to each other, have a mass of 13451 and 13137 Da. This corresponds to the calculated masses of the allele, with the 32bp deletion. The higher mass peak corresponds to the product, where adenosine has been added and the lower mass peak to the one without the unspecific adenosine. Both peaks have about the same height, indicating that to about half of the product adenosine has been added. The peak with a mass of 11682 Da is a doubly charged molecule of the DNA corresponding to 23319 Da (2×11682 Da=23364 Da). The peaks with masses of 6732 and 6575 Da are doubly charged molecules of the one with masses of 13451 and 13137 Da and the peak with 7794 Da corresponds to the triply charged molecule of 23319 Da. Multiple charged molecules are routinely identified by calculation. Amplified DNA derived from a homozygous individual shows in the spectrograph (FIG. 89C) one peak with a mass 23349.6 and a much smaller peak with a mass of 23039.9 Da. The higher mass peak corresponds to DNA resulting from a wildtype allele with an added adenosine, that has a calculated mass of 23349 Da. The lower mass peak corresponds to the same product without adenosine. Three further peaks with a mass of 11686, 7804.6 and 5852.5 Da correspond to doubly, triply and quadruply charged molecules.

[0694] The unspecific added adenine can be removed from the amplified DNA by treatment of the DNA and T4 DNA polymerase. DNA derived from a heterozygous and a homozygous individual was analyzed after T4 DNA polymerase treatment. FIG. 89B shows the spectrograph derived from heterozygous DNA. The peak corresponding to the wildtype strand has a mass of 23008 Da indicating that the added adenine had been removed completely. The same is observed for the strand with a mass of 13140 Da.

[0695] The other three peaks are multiply charged molecules of the parent peaks. The mass spectrograph for the homozygous DNA shows one peak that has a mass of 23004 Da, corresponding to the wildtype DNA strand without an extra adenine added. All other peaks are derived from multiply charged molecules of this DNA. The amplified products can be analyzed by direct determination of their masses, as described above, or by measuring the masses of products, that are derived from the amplified product in a further reaction. In this “primer oligo base extension (PROBE)” reaction, a primer that can be internal, as it is in the nested PCR, or identical to one of the PCR primers, is extended for just a few bases before the termination nucleotide is incorporated. Depending on the extension length, the genotype can be specified. CKRΔ-F was used as a PROBE primer, and dATP/dGTP and ddTTP as nucleotides. The primer extension is AGT in case of a wildtype template and AT in case of the deletion (FIG. 87). The corresponding masses are 6604 Da for the wildtype and 6275 Da for the deletion, respectively. PROBE was applied to two standard DNAs. The spectrograph (FIG. 90A) shows peaks with masses of 6604 Da corresponding to the wildtype DNA and at 6275 Da corresponding to the CKR-5 deletion allele (Table VIII). The peak at a mass of 5673 Da corresponds to CKRA-F (calculated mass of 5674 Da). Further samples were analyzed in analogous way (FIG. 90B). It is unambiguously identified as homozygous DNA, since the peak with a mass of 6607 Da corresponds to the wildtype allele and the peak with a mass of 5677 Da to the unextended primer. No further peaks were observed.

[0696] The example demonstrates that deletion analysis can be performed by mass spectrometry. As shown herein, the deletion can be analyzed by direct detection of single stranded amplified products, or by analysis of specifically generated diagntic products (PROBE). In addition, as shown in the following Example 26, double stranded DNA amplified products can be analyzed. Size Calculated Mass Measured Mass wildtype w/o A 23036 23039/23009/23004 wildtype with A 23349 23319/23350 deletion w/o A 13143 13137/13139 deletion with A 13456 13451 PROBE wildtype 6604 6604/6608 deletion 6275 6275

EXAMPLE 24

[0697] Pentaplex tc-PROBE

[0698] SUMMARY

[0699] The multiplexing of thermocycling primer oligo base extension (tc-PROBE) was performed using five polymorphic sites in three different apolipoprotein genes, which are thought to be involved in the pathogenesis of atherosclerosis. The apolipoprotein A IV gene (codons 347 and 360), the apolipoprotein E gene (codons 112 and 158), and the apolipoprotein B gene (codon 3500) were examined. All mass spectra were easy to interpret with respect to the five polymorphic sites.

[0700] MATERIALS AND METHODS

[0701] PCR Amplification

[0702] Human leukocytic genomic DNA was used for PCR. Listed below are the primers used for the separated amplification of portions of the Apo A IV, Apo E and the Apo B genes: Apo A IV: A347F: 5′-CGA GGA GCT CAA GGC CAG AAT-3′ (SEQ ID NO. 75) A360 R-2-bio: *5′-CAG GGG CAG CTC AGC TCT C-3′ (SEQ ID NO. 76) Apo E: ApoE-F: 5′-GGC ACG GCT GTC CAA GGA-3′ (SEQ ID NO. 77) ApoE-R bio: *5′-AGG CCG CGC TCG GCG CCC TC-3′ (SEQ ID NO. 78) Apo B: ApoB-F2 bio: *5′-CTT ACT TGA ATT CCA AGA GC-3′ (SEQ ID NO. 79) Apo B-R: 5′-GGG CTG ACT TGC ATG GAC CGG A-3′ (SEQ ID NO. 80)

[0703] Taq polymerase and 10×buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs for Pharmacia (Freiburg, Germany). The total PCR reaction volume was 50 μl including 10 pmol of each primer and 10% DMSO (dimethylsulfoxide, Sigma) (no DMSO for the PCR of the Apo B gene), with ˜200 mg of genomic DNA used as template and a final dNTP concentration of 200 μM. Solutions were heated to 80° C. before the addition of 1U Taq polymerase; PCR conditions were: 5 min at 95° C., followed by 2 cycles 30 sec 94° C., 30 sec 62° C., 30 sec 72° C., 2 cycles 30 sec 94° C. 30 sec 58° C., 30 sec 72° C., 35 cycles of 30 sec at 94° C., 30 sec at 56° C., 30 sec at 72° C., and a final extension time of 2 min at 72° C. To remove unincorporated primers and nucleotides, amplified products were purified using the “QlAquick” (Qiagen, Germany )kit, with elution of the purified products in 50 μL of TE buffer (10 mM Tris-HCl, 1 mM EDTA, pH 8.0).

[0704] Binding of the amplified product on beads

[0705] 10 μl of each purified amplified product was bound to 5 μl DynaBeads (Dynal, M-280 Streptavidin) and denatured according to the protocol from Dynal. For the pentaplex tc-PROBE reaction the three different amplified product (bound on the beads) were pooled.

[0706] Tc-PROBE

[0707] For the PROBE reaction the following primers were used: (Apo A) P347: 5′-AGC CAG GAC AAG-3′ (SEQ ID NO. 81) (Apo A) P360: 5′-ACA GCA GGA ACA GCA-3′ (SEQ ID NO. 82) (Apo E) P112: 5′-GCG GAC ATG GAG GAC GTG-3′ (SEQ ID NO. 83) (Apo E) P158: 5′-GAT GCC GAT GAC CTG CAG AAG-3′ (SEQ ID NO. 84) (Apo B) P3500: 5′-GTG CCC TGC AGC TTC ACT GAA GAC-3′ (SEQ ID NO. 85)

[0708] The tc-PROBE was carried out in a final volume of 25 μl containing 10 pmol of each primer listed above, 2.5 U Thermoquenase (Amersham), 2.5 μL Thermoquenase buffer, and 50 μM dTTP (final concentrations) and 200 μM of ddA/C/GTP, respectively. Tubes containing the mixture were placed in a thermocycler and subjected to the following cycling conditions: denaturation (94° C.) the supernatant was carefully removed from the beads and ‘desalted’ by ethanol precipitation to exchange nonvolatile cations such as Na+ and K+ with NH₄+, which evaporated during the ionization process; 5 μL 3M ammonium acetate (pH 6.5) 0.5 μL glycogen (10 mg/mL, Sigma), 25 μL H₂O, and 110 μL absolute ethanol were added to 25 μL PROBE supernatant and incubated for 1 hour at 4° C. After a 10 min. centrifugation at 13,000×g, the pellet was washed in 70% ethanol and resuspended in 1 μL 18 Mohm/cm H₂O. A 0.35 μL aliquot of resuspended DNA was mixed with 0.35 μL matrix solution (0.7 M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H₂O:CH₃CN) on a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using the Thermo Bioanalysis Version 2000 MALDI-TOF operated in reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively, Theoretical average molecular masses (M₁(calc)) of the fragments were calculated from atomic compositions. External calibration generated from synthetic (ATCG)_(n) oligonucleotide (3.6-18kDa) was used. Positive ion spectra from 1-37500 Da were collected.

[0709] RESULTS

[0710] Table VIII shows the calculated molecular masses of all possible extension products including the mass of the primer itself. FIG. 91 shows a respective MALDI-TOP MS spectra of a tc-PROBE using three different templates and 5 different PROBE primers simultaneously in ne reaction. Comparison of the observed and calculated masses (see table Vil) allows a fast genetic profiling of various polymorphic sites in an individual DNA sample. The sample presented in FIG. 91 is homozygous for threonine and glutamine at position 347 and 360, respectively, in the apolipoprotein A IV gene, bears the epsilon 3 allele homozygous in the apolipoprotein E gene, and is also homozygous at the codon 3500 for arginine in the apolipoprotein B gene. TABLE VIII SEQ ID mass allele Apolipoprotein A IV 5′-AGCCAGGACAAG-3′(347) 86 3688.40 unextended primer 5′-AGCCAGGACAAGTC-3′ 87 4265.80 347Ser 5′-AGCCAGGACAAGA-3′ 88 3985.60 347Thr 5′-ACAGCACCAACAGCA-3′(360) 89 4604.00 unextended primer 5′-ACAGCAGGAACAGCATC-3′ 90 5181.40 360His 5′-ACAGCAGGAACAGCAG-3′(112) 91 4917.20 360Gln Apolipoprotein E 5′-GCGGACATGGAGGACGTG-3′(112) 92 5629.60 unextended primer 5′-GCGGACATGGAGGACGTGGC-3′ 93 6247.00 112Cys 5′-GCGGACATGGAGGACGTGC-3′ 94 5902.80 112Arg 5′-GATGCCGATGACCTGCAGAAG-3′(158) 95 6480.20 unextended primer 5′-GATGCCGATGACCTGCAGAAGC-3′ 96 6753.40 158Arg 5′-GATGCCGATGACCTGCAGAAGTG-3′ 97 7097.60 158Cys Apolipoprotein B-100 5′-GTGCCCTGCAGCTTCACTGAAGAG-3′(3500) 98 7313.80 unextended primer 5′-GTGCCCTGCAGCTTCACTGAAGACTG-3′ 99 7931.20 3500Gln 5′-GTGCCCTGGAGCTTCACTGAAGACC-3′ 100  7587.00 3500Arg

EXAMPLE 25

[0711] Sequencing Exons 5 to 8 of the p53 Gene by MALDI-TOF Mass Spectrometry

[0712] MATERIALS & METHODS

[0713] Thirty-five cycles of PCR reactions were performed in a 96 well microliter plate with each well containing a total volume of 50 μl including 200 ng genomic DNA, 1 unit Taq DNA polymerase, 1.5 mM Mg C1₂, 0.2mM dNTPx, 10 pmol of the forward primer and 6 or 8 of the biotinylated reverse primer. The sequences of PCR primers prepared according to established chemistry (N. D. Sinha, J. Biernat, H. Kter, Tetrahed. Lett. 24:5843-5846 (1983) are as follows: exon 5:d(biotin- TATCTGTTCACTTGTGCCC SEQ ID NO. 101) and d(biotin-CAGAGGCCTGGGGACCCTG SEQ ID NO.102); exon 6: D(ACGACAGGGCTGGTTGCC SEQ ID NO. 103) and d(biotin-ACTGACAACCACCCTTAAC SEQ ID NO. 104); exon 7: d(CTGCTTGCCACAGGTCTC SEQ ID NO. 105) and d(biotin CACAGCAGGCCAGTGTGC SEQ ID NO. 106; exon 8: d(GGACCTGATTTCCTTACTG SEQ ID NO. 107) and d(biotin-TGAATCTGAGGCATAACTG SEQ ID NO. 108).

[0714] To each well of the 96-well microliter plate containing unpurified amplified product, 0.1 mg of paramagnetic streptavidin beads (Dynal) in 10 μl of 5×binding solution (5 M NH₄OH) was added and incubated at 3 7° C. for 30 min.

[0715] Then beads were treated with 0.1 M NaOH at room temperature for 5 min followed by one wash with 50 mM NH₄OH at room temperature for 5 min followed by one wash with 50 mM Tris-HCl.

[0716] Four dideoxy termination reactions were carried out in separate wells of the microliter plate. A total of 84 reactions (21 primers x4 reactions/primer) can be performed in a single microliter plate. To each well containing immobilized single-stranded template, a total volute of 10 μl reaction mixture was added including lx reaction buffer, 10 pmol of sequencing primer, 250 mM of dNTPs, 25 mM of one of the ddNTPs, and 1+182 units of Thermosequenase (Amersham). Sequencing reactions were carried out on a thermal cycler using non-cycling conditions: 80° C., 1 min, 50° C., 1 min, 50° C. to 72° C., ramping 0.1°sec, and 72° C., 5 min. The beads were then washed with 0.7 M ammonium citrate followed by 0.05 M ammonium citrate. Sequencing products were then removed from beads by heating the beads to 80° C. in 2 μl of 50 mM NH₄OH for 2 min. The supernatant was used for MALDI-TOF MS analysis.

[0717] Matrix was prepared as described in Kter, et al (Kter, H. et al, Nature Biotechnol. 14: 1123-1128 (1996)). This saturated matrix solution was then diluted 1.52 times with pure water before use. 0.3 μl of the diluted matrix solution was then diluted 1.52 times with pure water before use. 0.3 μl of the diluted matrix solution was loaded onto the sample target and allowed to crystallize followed by addition of 0.3 μl of the aqueous analyte. A Perseptive Voyager DE mass spectrometer was used for the experiments, and the samples were typically analyzed in the manual mode. The target and middle plate were kept at +18.2 kV for 200 nanoseconds after each laser shot and then the garget voltage was raised to +20 kV. the ion guide wire in the flight tube was kept at −2V. Normally, 250 laser shots were accumulated foe each sample. The original spectrum was acquired under 500 MHz digitizing rate, and the final spectrum was smoothed by a 455 point average (Savitsky and Golay, (1964) Analytical Chemistry, 36:1627). Default calibration of the mass spectrometer was used to identify each peak and assign sequences. The theoretical mass values of two sequencing peaks were used to recalibrate each spectrum. (D. P. Little, T. J. Cornish, M. J. O'Donnel, A. Braun, R. J. Cotter, H. Kter, Anal. Chem., submitted).

[0718] RESULTS

[0719] Alterations of the p53 gene are considered to be a critical step in the development of many human cancers (Greenblatt, et al., (1994) Cancer Res. 54, 4855-4878; C. C. Harris, (1996) J. Cancer, 73, 261-269; and D. Sidransky and M. Hollstein, (1996) Annu. Res. Med., 47,285-301). Mutations may serve as molecular indicators of clonality or as early markers of relapse in a patient with a previously identified mutation in a primary tumor (Hainaut, et al., (1997) Nucleic Acid Res., 25, 151-157). The prognosis of the cancer may differ according to the nature of the p53 mutations present (H.S. Goh et al., (1995) Cancer Res, 55, 5217-5221). Since the discovery of the p53 gene, more than 6000 different mutations have been detected. Exons 5-8 were selected as sequencing targets where most of the mutations cluster (Hainaut et al. (1997) Nucleic Acids Res., 25, 151-7).

[0720]FIG. 96 schematically depicts the single tube process for target amplification and sequencing, which was performed, as described in detail in the Materials and Methods. Each of exon 5-8 of the p53 gene was PCR amplified using flanking primers in the intron region; the down stream primer was biotinylated. Amplifications of different exons were optimized to use the same cycling profile, and the products were used without further purification. PCR reactions were performed in a 96 well microliter plate and the product generated in one well was used as the template for one sequencing reaction. Streptavidin-coated magnetic beads were added to the same microliter plate and amplified products were immobilized. The beads were then treated with NaOH to generate immobilized single-stranded DNA as sequencing template. The beads were washed extensively with Tris buffer since remaining base would reduce the activity of sequencing enzyme.

[0721] A total of 21 primers were selected to sequence exon 5-8 of the p53 gene by primer walking. The 3′-end nucleotide of all the primers is located at the site where no known mutation exists. Four termination reactions were performed separately which resulted in a total of 84 sequencing reactions on the same PCR microliter plate. Non-cycling conditions were adopted for sequencing since streptavidin coated beads do not tolerate the repeated application of high temperature. Sequencing reactions were designed so that mt terminated fragments were under 70 nucleotides, a size range easily accessible by MALDI-TOF MS and yet long enough to sequence through the next primer binding site. Thermequenase was the enzyme of choice since it could reproducible generate a high yield of sequencing products in the desired mass range. After the sequencing reactions, the beads were washed with ammonium ion buffers to replace all other cations. The sequencing ladders were then removed from the beads by heating in ammonium hydroxide solution or simply in water.

[0722] A sub-microliter aliquot of each of the 84 sequencing reactions was loaded onto one MS sample holder containing preloaded matrix. FIG. 94 gives an example of sequencing data generated from one primer; four spectra are superimposed.

[0723] All sequencing peaks were well resolved in the mass range needed to read through the next sequencing primer site. Sometimes doubly charged peaks were observed which could be easily identified by correlating the mass to that of the singly charged ion. False stops generated by early termination of the enzymatic extension can be observed cle to the primer site. Since the mass resolution is high enough, it is easy to differentiate the false stop peaks from the real sequencing peaks by calculating the mass difference of the neighboring peaks and crs comparing the four spectra. Additionally, mt primers generated detectable data through the region of the downstream primer binding site thereby covering the false stop region.

[0724] Using optimized procedures of amplification, sequencing, and conditioning, exons 5-8 of the p53 gene were successfully sequenced. Correct wildtype sequence data were obtained from all exons with a mass resolution about 300 to 800 over the entire mass range. The overall mass accuracy is 0.05% or better. The average amount of each sequencing fragment loaded on the MS sample holder is estimated to be 50 fmol or less.

[0725] This example demonstrates the feasibility of sequencing exons of a human gene by MALDI-TOF MS. Compare to gel-based automated fluorescent DNA sequencing, the read lengths are shorter. Microchip technology can be incorporated to provide for parallel processing. Sequencing products generated in the microtiter plate can be directly transferred to a microchip which serves as a launching pad for MALDI-TOF MS analysis. Robot-driven serial and parallel nanoliter dispensing tools are being used to produce 100-1000 element DNA arrays on <1″ square chips with flat or geometrically altered (e.g., with wells) surfaces for rapid mass spectrometric analysis.

[0726]FIG. 94 shows an MS spectrum obtained on a chip where the sample was transferred from a microtiter plate by a pintool. The estimated amount of each termination product loaded is 5 fmol or less which is in the range of amounts used in conventional Sanger sequencing with radiolabeled or fluorescent detection (0.5-1 fmol per fragment). The low volume MALDI sample deposition has the advantages of miniaturization (reduced reagent cts), enhanced reproducibility and automated signal acquisition.

EXAMPLE 26

[0727] Direct detection of synthetic and biologically generated double-stranded DNA by MALDI-TOF MS

[0728] Introduction

[0729] Typically, matrix-associated laser desorption/ionization (Karas, et. al., (1989) Int. J. Mass Spectrom, Ion Processes, 92, 231) time-of-flight mass spectrometry (MALDI-TOF MS) of DNA molecules which are double stranded (ds) in solution yields molecular ions representative of the two single stranded components (Tang, etal. (1994) Rapid Commun. Mass Spectrom. 8:183; Tang, etal. (1995) Nucleic Acids Res. 23:3126; Benner, etal. (1995) Rapid Commun. Mass Spectrom. 9:537; Liu, etal. (1995) Anal. Chem. 67:3482; Siegert et al. (1996) Anal. Biochem. 243:55; and Doktycz, etal. (1995) Anal. Biochem. 230:205); this has been observed in several reports dealing with biologically generated DNA from a polymerase chain reaction (PCR) amplification (Tang, et al. (1994) Rapid Commun. Mass Spectrom. 8:183; Liu, etal. (1995) Anal. Chem. 67:3482; Siegert etal. (1996) Anal. Biochem. 243:55; and Doktycz, et al. (1995) Anal. Biochem. 230:205). It is not clear whether the double strand is destabilized because of the decreased pH in the matrix environment or because of absorbance by the duplex during desorption/ionization/acceleration of an energy sufficient to overcome the attractive van der Waals and “stacking” stabilization forces (Cantor and Shimmel, Biophysical Chemistry Part I: The conformation of Biomolecules, W. H. Freeman, New York, (1980), 176). When analyte is present at high concentrations formation of non-specific gas-phase DNA multimers is, as with proteins (Karas, et. al., (1989) Int. J. Mass Spectrom, Ion Processes 92:231), common; however, Lecchi and Pannell (Lecchi et al (1995) J. Am. Soc. Mass Spectrom. 6:972) have provided strong evidence for specific Watson Crick (WC) base pairing being maintained in the gas phase. They detected these specific dimers when using 6-aza-2-thiothymine as a matrix, but did not observe them with 3-hydroxypicclinic acid (3-HPA) or 2,4,6-hydroxyacetophenone matrix. As described below, by using a low acceleration voltage of the ions and preparing samples for MALDI analysis at reduced temperatures, routine detection of dsDNA is possible.

[0730] MATERIALS AND METHODS

[0731] Synthetic DNA. Oligonucleotides were synthesized (Sinha, et al. (1984) Nucleic Acids Res., 12, 4539) on a Perspective Expedite DNA synthesizer and reverse phase HPLC purified in-house. Sequences were: 50-mer (15337 Da): 5′-TTG CGT ACA CAC TGG CCG TCG TTT TAC AAC GTC GTG ACT GGG AAA ACC CT-3′ (SEQ ID NO. 109); 27-mer_(c) (complementary, 8343 Da): 5′-GTA AAA CGA CGG CCA GTG TGT ACG CAA-3′ (SEQ ID NO. 110); 27-mer_(nc) (non-complementary, 8293 Da): 5′-TAC TGG AAG GCG ATC TCA GCA ATC AGC-3′ (SEQ ID NO. 111). 100 μM stock solutions were diluted to 20, 10, 5, and 2.5 μM using 18 Mohm/cm H₂O. 2 μL each of equimolar solutions of the 50-mer and either 27-mer_(c) or 27-mer_(nc) were mixed and allowed to anneal at room temperature for 10 minutes. 0.5 μL of these mixtures were mixed directly on a sample target with 1 μL matrix (0.7 M 3-HPA, 0.07 M ammonium citrate in 50% acetonitrile) and allowed to air dry.

[0732] Biological DNA. Enzymatic digestion of human genomic DNA from leukocytes was performed. PCR primers (forward, 5′-GGC ACG GCT GTC CAA GGA G-3′ (SEQ ID NO. 112)); reverse, 5′-AGG CCG CGC TCG GCG CCC TC-3′ (SEQ ID NO. 113) to amplify a portion of exon 4 of the apolipoprotein E gene were delineated from the published sequence (Das et al., (1985) J. Biol. Chem., 260 6240). Taq polymerase and 10×buffer were purchased from Boehringer-Mannheim (Germany) and dNTPs from Pharmacia (Freiburg, Germany). The total reaction volume was 50 μl including 20 pmol of each primer and 10% DMSO (dimethylsulfoxide, Sigma) with approximately 200 ng of genomic DNA used as template. Solutions were heated to 80° C. before the addition of IU polymerase; PCR conditions were: 2 min at 94° C., followed by 40 cycles of 30 sec at 94° C., 45 sec at 63° C., 30 sec at 72° C., and a final extension time of 2 min at 72° C. While no quantitative data was collected to determine the final yield of amplified product, it is estimated that -2pmol were available for the enzymatic digestion.

[0733] Cfol and Rsal and reaction buffer L were purchased from Boehringer-Mannheim. 20 μl of amplified products were diluted with 15 μl water and 4 μl buffer L; after addition of 10 units of restriction enzymes the samples were incubated for 60 min at 37° C. For precipitation of digest products 5 μl of 3M ammonium acetate (pH 6.5), (5 μl glycogen (Braun, etal. (1997) Clin. Chem. 43:1151) (1Omg/ml, Sigma), and 110 μl absolute ethanol were added to 50 μL of the analyte solutions and stored for 1 hour at room temperature. After at 10 min centrifugation at 13,000×g, the pellet was washed in 70% ethanol and resuspended in 1 μl 18 Mohm/cm H₂O.

[0734] Sample preparation and analysis by MALDI-TOF MS. 0.35 μl of resuspended DNA was mixed with 0.35-1.3 μL matrix solution (0.7M 3-hydroxypicolinic acid (3-HPA), 0.07 M ammonium citrate in 1:1 H₂O:CH₃CN) (Wu, et al. (1993) Rapid Commun. Mass Spectrom. 7:142) on a stainless steel sample target disk and allowed to air dry preceding spectrum acquisition using a Thermo Bioanalysis Vision 2000 MALDI-TOF instrument operated in pitive ion reflectron mode with 5 and 20 kV on the target and conversion dynode, respectively. Theoretical average molecular masses (M_(r)(calc)) of the fragments were calculated from atomic compositions; the mass of a proton (1.08 Da) was subtracted from raw data values in reporting experimental molecular masses (M_(r)(exp)) as neutral basis. External calibration generated from eight peaks (2000-1 8000 Da) was used for all spectra.

[0735] Results and Discussion

[0736]FIG. 96A is a MALDI-TOF mass spectrum of a mixture of the synthetic 50-mer with (non-complementary) 27-mer_(nc) (each 10 μM, the highest final concentration used in this study); the laser power was adjusted to just above the threshold irradiation for ionization. The peaks at 8.30 and 15.34 kDa represent singly charged ions derived from the 27- and 50-mer single strands, respectively. Poorly resolved low intensity signals at −16.6 and −30.7 kDa represent homodimers of 27-and 50-mer, respectively; that at 23.6 kDa is consistent with a heterodimer containing one 27-mer and one 50-mer strand. Thus low intensity dimer ions representing all possible combinations from the two non-complementary oligonucleotides (27+27; 27+50; 50+50) were observed. Increasing the irradiance even to a point where depurination peaks dominated the spectrum resulted in slightly higher intensities of these dimer peaks. Note that the hybridization was performed at room temperature and with a very low salt concentration, conditions at which non-specific hybridization may occur.

[0737]FIG. 96 shows a MALDI-TOF spectrum of the same 50-mer mixed with (complementary) 27-mer_(c); the final concentration of each oligonucleotide was again 10 μM. Using the same laser power as in FIG. 96A, intense signals were again observed at 88.34 and 15.34 Kda, consistent with single stranded 27- and 50-mer, respectively. Homodimer peaks (27+27; 50+50) were barely apparent in the noise; however, singly (23.68 Kda) and doubly (11.84k Da) charged heterodimer (27+50) peaks were dominant. Although the 23.68 Kda dimer peak could be detected from all irradiated positions, its intensity relative to the monomer peaks varied slightly from spot-to-spot. Repeating the experiment with individual oligonucleotide concentrations of 5, 2.5, and 1.25 μM resulted in decreasing amounts of the 27-/50-mer Watson-Crick dimer peak relative to the 27- and 50-mer single stranded peaks. At the lowest concentrations, the observation of dimer was “crystal-dependent”, that is, irradiation of some crystals produced significant 27-/50-mer dimer signal, while other crystals reproducibly yielded very little or none. This indicates that the incorporation of dsdna into the matrix crystals or the effectiveness of retaining this interaction through the ionization/desorption process is dependent upon the microscopic properties of the crystals, and/or that there exist steep concentration gradients of the duplex throughout the sample.

[0738] Thus the FIG. 96 spectra provide strong evidence that specific WC base paired dsdna can be observed using gentle laser conditions with high concentrations of oligonucleotides in this mass range, the first report of this using a 3-HPA matrix. The study was extended to a complex mixture of dsdna derived from an enzymatic digest (Rsal/Cfol) of a region of exon 4 of the apolipoprotein E gene (Das et. al., (1985) J. BioL Chem., 260 6240); expected fragment masses are given in Table IX. TABLE IX CfoI/RsaI Digestion Products from ApoE gene exon 4² bases^(b) ssDNA (Da) dsdna (Da) (+) (−) (+) (−) A 11 13 3428 4025 7453 16 5004 4924 9928 18 5412 5750 11162 17 19 5283 5880 11163 19 5999 5781 11780 24 22 7510 6745 14225 31 29 9628 9185 18813 36 38 11279 11627 22906 48 14845 14858 29703 55 53 17175 16240 33415

[0739] After the digestion step, the samples were purified and concentrated by ethanol precipitation and resuspended in 1 μL H₂O before mixing them at room temperature with matrix on the sample target. Nearly 20 peaks ranging in mass from 3.4-17.2 Kda were resolved in the products' MALDI spectrum (FIG. 97A), all consistent with denatured single stranded components of the double strand (Table IX). Many such analyses of similar biological products over a period of months also yielded spectra with negligible dsdna, consistent with previous reports (Tang, etal. (1994) Rapid Commun. Mass Spectrom. 8:183; Liu, et al. (1995) Anal. Chem. 67:3482; Siegert et al. (1996) Anal. Biochem. 243:55; and Doktycz, etal. (1995) Anal. Biochem. 230:205); contrarily, intact double strands were observed under similar conditions for the synthetic DNA (FIG. 96A). It is difficult to estimate the strand concentration available after the biological reactions, but presumably that it was far lower than that at which dimerization of synthetic samples occurred. Furthermore, maintaining specific hybrids within the two-component synthetic mixture may be kinetically favored relative to the far more complex mixture of 20 single-stranded DNA components from the digest.

[0740] The effect of reduced temperature on maintaining dsDNA was tested. An aliquot of the digested DNA solution, the matrix, pipette, pipette tips, and the stainless steel sample target were stored in a 4° C. “cold room” for 15 minutes; as with normal preparations matrix, and then analyte, were spotted on the target and allowed to co-crystallize while air drying. Crystallization for mixtures of 300 nL 3HPA (50% acetonitrile) with 300 nL analyte required ˜1 minute at room temperature but ˜15 minutes at the reduced temperature. Sample spots prepared in the cold room environment typically contained a high proportion of large transparent crystals.

[0741] MALDI-TOF analysis of an ApoE digest aliquot prepared at reduced temperature produced the FIG. 97B spectrum. While the low mass range appeared qualitatively similar to FIG. 97A, dramatic differences above 8 kDa were observed. Only signals consistent with single strands (Table IX) were observed in FIG. 97A, but the FIG. 97B cold room prepared samples did not yield signals for the same masses except below 8 kDa. Even more striking were the additional high mass peaks in FIG. 97B; clearly these represent dimer peaks containing lower mass components. As was done with the synthetic DNA, it was important to determine whether these represent non-specific heterodimers, specific WC heterodimers, or nonspecific homodimers. Consider first the 33.35 kDa fragment. Ignoring the unlikely possibility that the high mass fragment represents a trimer or higher multimer, as a dimer it must only contain the highest mass ssDNA components, i.e., the >16 kDa. Homodimerization of the 15.24 and 17.18 kDa fragments would result in 32.49 and 34.35 kDa peaks, respectively; corresponding mass errors for these incorrect assignments relative to the observed 33.35 kDa would be −2.6% and +3.0% respectively. A far better match is achieved if this peak originates from a heterodimer of the two highest mass single stranded fragments; their summed mass (16.24+17.18=33.42 kDa) differed by 0.2% from the observed dimer mass 33.35 kDa, an acceptable mass error for MALDI-TOF analysis of large DNA fragments using external calibration. Likewise, the 29.66kDa fragment was measured only 0.13% lower than the 29.70 Da expected for a heterodimer of 48-mers; the sum of no other possible homodimers or heterodimers were within a reasonable range of this mass. Similar arguments could be made for the 22.89 and 18.83 kDa fragments, representing 36-/38-mer and 31-/29-heterodimers, respectively; the signal at 14.86 kDa is consistent with singly charged single stranded and doubly charged double-stranded 48-mer. The agreement of the FIG. 97B masses above 15 kDa with the of dsDNA expected from this digest and the absence of homodimers and non-specific heterodimers at random masses indicated that the base pairings were indeed highly specific and provided further evidence that gas-phase WC interactions may be retained in MALDI-generated ions.

[0742]FIG. 98 shows a MALDI-TOF spectrum of an ε4 allele, which, unlike the ε3, was expected to yield no 36-/38-mer pair upon Cfol/Rsal digestion. The ε3 and ε4 mass spectra were similar except that abundant 22.89 kDa fragment in FIG. 97B was not present in FIG. 98; with this information alone (Table IX) ε3 and ε4 alleles were easily distinguished, thereby demonstrating the genotyping by direct measurement of dsDNA by MALDI-TOF MS. Similarly dsDNA could be ionized, transferred to the gas phase, and detected by MALDI-TOF MS. The acceleration voltage typically employed on our instrument was only −5kV corresponding to 1.5 kV/mm up to −2 mm from the sample target, with the electric field strength decreasing rapidly with distance from the sample target. Most previous work used at least 20 kV acceleration (Lecchi et al. (1995) J. Am. Soc. Mass Spectrom. 6:972); in one exception a 27-mer dsDNA was detected using a frozen matrix solution and 100 V acceleration (Nelson, etal. (1990) Rapid Commun. Mass Spectrom. 4:348). Without being bound by any theory MALDI-induced “denaturation” of dsDNA may be due to gas-phase collisional activation that disrupts the WC pairing when high acceleration fields are employed, analogous to the denaturation presumed to be a first step in the fragmentation used for sequencing the single stranded components of dsDNA using electrospray ionization (McLafferty et al. (1996) Int. J. Mass Spectrom., Ion Processes). It appears that the high salt concentrations (typically >1 OmM NaCl or KCl) required to stabilize WC paired dsDNA in solution are unsuitable for MALDI analysis (Nordhoff et al. (1 993) Nucleic Acids Res. 21:3347); reducing the concentration of such non-volatile cations is necessary to avoid cation-adducted MALDI signals, but destabilizes the double strands in solution. The low pH conditions of the matrix environment should also destabilize the duplex. As shown in FIGS. 97B and 98, storing and preparing even low concentrations of the biological samples at reduced temperature at least in part offset these denaturing effects, especially for longer strands where melting temperatures are higher due to a more extensive hydrogen bonding network. The conditions used here are recognized to be very non-stringent annealing conditions.

[0743] The low mass tails on high mass dsDNA peaks (e.g., FIG. 97B, 232 kDa) are consistent with depurination generated to a higher extent than the sum of depurination from each of the single strands combined. Although depurination in solution is an acid-catalyzed reaction, the weakly acidic conditions in the 3-HPA matrix do not induce significant depurination; molecular ion signals from a mixed-base 50-mer measured with De-MALDI-TOF had only minor contributions from depurination peaks (Juhaz, et al. (1996) Anal. Chem. 68:941). Depurination from the single stranded components of the gas-phase dsDNA is observed even though these bases are expected to be hydrogen bonded to the complementary base of the accompanying strand, implying that covalent bonds are being broken before the strand is denatured.

EXAMPLE 27

[0744] Efficiency and Specificity Assay for Base-Specific Ribonucleases

[0745] Aliquots sampled at regular time intervals during digestion of selected synthetic 20 to 25 mers were analyzed by mass spectrometry. Three of the RNAses were found to be efficient and specific. These include: the G-specific T₁, the A-specific U₂ and the A/U-specific PhyM. The ribonucleases presumed to be C-specific were found to be less reliable, e.g., did not cleave at every C or also cleaved at U in an unpredictable manner. The three promising RNAses all yielded cleavage at all of the predicted positions and a complete sequence coverage was obtained. In addition, the presence of cleavage products containing one or several uncleaved positions (short incubation times), allowed alignment of the cleavage products. An example of the MALDI-spectrum of an aliquot sampled after T₁ digest of a synthetic 20-mer [SEQ ID NO:114] RNA is shown in FIG. 100.

EXAMPLE 28

[0746] Immobilization of amplified DNA targets to silicon wafers

[0747] Silicon surface preparation

[0748] Silicon wafers were washed with ethanol, flamed over bunsen burner, and immersed in an anhydrous solution of 25% (by volume) 3-aminopropyltriethoxysilane in toluene for 3 hours. The silane solution was then removed, and the wafers were washed three times with toluene and three times with dimethyl sulfoxide (DMSO). The wafers were then incubated in a 10 mM anhydrous solution of N-succinimidyl (4-iodoacetyl) aminobenzoate (SIAB) (Pierce Chemical, Rockford, Ill.) in anhydrous DMSO. Following the reaction, the SIAB solution was removed, and the wafers were washed three times with DMSO. In all cases, the iodoacetamido-functionalized wafers were used immediately to minimize hydrolysis of the labile iodoacetamido-functionality. Additionally, all further wafer manipulations were performed in the dark since the iodoacetamido-functionality is light sensitive.

[0749] Immobilization of amplified thiol-containing nucleic acids

[0750] The SlAB-conjugated silicon wafers were used to analyze specific free thiol-containing DNA fragments of a particular amplified DNA target sequence. A 23-mer oligodeoxynucleotide containing a 5′-disulfide linkage [purchased from Operon Technologies; SEQ ID NO: 11 7] that is complementary to the 3′-region of a 112 bp human genomic DNA template [Genebank Acc. No.: Z52259; SEQ ID NO: 118] was used as a primer in conjunction with a commercially available 49-mer primer, which is complementary to a portion of the 5′-end of the genomic DNA [purchased from Operon Technologies; SEQ ID NO: 119], in PCR reactions to amplify a 135 bp DNA product containing a 5′-disulfide linkage attached to only one strand of the DNA duplex [SEQ ID NO: 120].

[0751] The PCR amplification reactions were performed using the Amplitaq GoldKit [Perkin Elmer Catalog No. N808-0249]. Briefly, 200 ng 112 bp human genomic DNA template was incubated with 10 μM of 23-mer primer and 8 μM of commercially available 49-mer primer, 10 mM dNTPs, 1 unit of Amplitaq Gold DNA polymerase in the buffer provided by the manufacturer and PCR was performed in a thermocycler.

[0752] The 5′-disulfide bond of the resulting amplified product was fully reduced using 10 mM tris-(2-carboxyethyl) phosphine (TCEP) (Pierce Chemical, Rockford, Ill.) to generate a free 5′-thiol group. Disulfide reduction of the modified oligonucleotide was monitored by observing a shift in retention time on reverse-phase FPLC. It was determined that after five hours in the presence of 10 mM TCEP, the disulfide was fully reduced to a free thiol. Immediately following disulfide cleavage, the modified oligonucleotide was incubated with the iodacetamido-functionalized wafers and conjugated to the surface of the silicon wafer through the SIAB linker. To ensure complete thiol deprotonation, the coupling reaction was performed at pH 8.0. Using 10 mM TCEP to cleave the disulfide and the other reaction conditions described above, it was possible to reproducibly yield a surface density of 250 fmol per square mm of surface.

[0753] Hybridization and MALDI-TOF Mass spectrometry

[0754] The silicon wafer conjugated with the 1 35 bp thiol-containing DNA was incubated with a complementary 12-mer oligonucleotide [SEQ ID NO: 121] and specifically hybridized DNA fragments were detected using MALDI-TOF MS analysis. The mass spectrum revealed a signal with an observed experimental mass-to-charge ratio of 3618.33; the theoretical mass-to-charge ratio of the 12-mer oligomer sequence is 3622.4 Da.

[0755] Thus, specific DNA target molecule that contain a 5′-disulfide linkage can be amplified. The molecules are immobilized at a high density on a SIAB-derivatized silicon wafer using the methods described herein and specific complementary oligonucleotides may be hybridized to these target molecules and detected using MALDI-TOF MS analysis.

EXAMPLE 29

[0756] Use of High Density Nucleic Acid Immobilization to Generate Nucleic Acid Arrays

[0757] Employing the high density attachment procedure described in EXAMPLE 28, an array of DNA oligomers amenable to MALDI-TOF mass spectrometry analysis was created on a silicon wafer having a plurality of locations, e.g., depressions or patches, on its surface. To generate the array, a free thiol-containing oligonucleotide primer was immobilized only at the selected locations of the wafer [e.g., see EXAMPLE 28]. The each location of the array contained one of three different oligomers. To demonstrate that the different immobilized oligomers could be separately detected and distinguished, three distinct oligonucleotides of differing lengths that are complementary to one of the three oligomers were hybridized to the array on the wafer and analyzed by MALDI-TOF mass spectrometry.

[0758] Oligodeoxynucleotides

[0759] Three sets of complementary oligodeoxynucleotide pairs were synthesized in which one member of the complementary oligonucleotide pair contains a 3′- or 5′-disulfide linkage [purchased from Operon Technologies or Oligos, Etc.]. For example, Oligomer 1 [d(CTGATGCGTCGGATCATCTTTTTT-SS); SEQ ID NO: 122] contains a 3′-disulfide linkage whereas Oligomer 2 [d(SS-CCTCTTGGGAACTGTGTAGTATT); a 5′-disulfide derivative of SEQ ID NO: 117] and Oligomer 3 [d(SS-GAATTCGAGCTCGGTACCCGG); a 5′-disulfide derivative of SEQ ID NO: 115] each contain a 5′-disulfide linkage.

[0760] The oligonucleotides complementary to Oligomers 1-3 were designed to be of different lengths that are easily resolvable from one another during MALDI-TOF MS analysis. For example, a 23-mer oligonucleotide [SEQ ID NO: 123] was synthesized complementary to a portion of Oligomer 1, a 12-mer oligonucleotide [SEQ ID NO: 121] was synthesized complementary to a portion of Oligomer 2 and a 21-mer [SEQ ID NO: 116] was synthesized complementary to a portion of Oligomer 3. In addition, a fourth 29-mer oligonucleotide [SEQ ID NO: 124] was synthesized that lacks complementarity to any of the three oligomers. This fourth oligonucleotide was used as a negative control.

[0761] Silicon surface chemistry and DNA immobilization

[0762] (a) 4×4 (16-location) array

[0763] A 2×2 cm² silicon wafer having 256 individual depressions or wells in the form of a 16×16 well array was purchased from a commercial supplier [Accelerator Technology Corp., College Station, Tex.]. The wells were 800×800μm², 120 μm deep, on a 1.125 pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysilane to produce a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinker SIAB resulting in iodoacetamido functionalities on the surface [e.g., see EXAMPLE 28].

[0764] To prepare the oligomers for coupling to the various locations of the silicon array, the disulfide bond of each oligomer was fully reduced using 10 mM TCEP as depicted in EXAMPLE 28, and the DNA resuspended at a final concentration of 10 μM in a solution of 100 mM phosphate buffer, pH 8.0. Immediately following disulfide bond reduction, the free-thiol group of the oligomer was coupled to the iodoacetamido functionality at 1 6 locations on the wafer using the probe coupling conditions essentially as described above in EXAMPLE 28. To accomplish the separate coupling at 16 distinct locations of the wafer, the entire surface of the wafer was not flushed with an oligonucleotide solution but, instead, an ˜30-nl aliquot of a predetermined modified oligomer was added in parallel to each of 16 locations (i.e., depressions) of the 256 wells on the wafer to create a 4×4 array of immobilized DNA using a robotic pintool.

[0765] The robotic pintool consists of 16 probes housed in a probe block and mounted on an X Y, Z robotic stage. The robotic stage was a gantry system which enables the placement of sample trays below the arms of the robot. The gantry unit itself is composed of X and Y arms which move 250 and 400 mm, respectively, guided by brushless linear servo motors with positional feedback provided by linear optical encoders. A lead screw driven Z axis (50 mm vertical travel) is mounted to the xy axis slide of the gantry unit and is controlled by an in-line rotary servo motor with positional feedback by a motor-mounted rotary optical encoder. The work area of the system is equipped with a slide-out tooling plate that holds five microtiter plates (most often, 2 plates of wash solution and 3 plates of sample for a maximum of 11 52 different oligonucleotide solutions) and up to ten 20×20 mm wafers. The wafers are placed precisely in the plate against two banking pins and held secure by vacuum. The entire system is enclosed in plexi-glass housing for safety and mounted onto a steel support frame for thermal and vibrational damping. Motion control is accomplished by employing a commercial motion controller which was a 3-axis servo controller and is integrated to a computer; programming code for specific applications is written as needed.

[0766] To create the DNA array, a pintool with assemblies that have solid pin elements was dipped into 16 wells of a multi-well DNA source plate containing solutions of Oligomers 1-3 to wet the distal ends of the pins, the robotic assembly moves the pin assembly to the silicon wafer, and the sample spotted by surface contact. Thus, one of modified Oligomers 1-3 was covalently immobilized to each of 16 separate wells of the 256 wells on the silicon wafer thereby creating a 4×4 array of immobilized DNA.

[0767] In carrying out the hybridization reaction, the three complementary oligonucleotides and the negative control oligonucleotide were mixed at a final concentration of 10 μM for each oligonucleotide in 1 ml of TE buffer [10 mM Tris-HCl, pH 8.0, 1 mM EDTA] supplemented with 1 M NaCl, and the solution was heated at 65° C. for 10 min. Immediately thereafter, the entire surface of the silicon wafer was flushed with 800 μl of the heated oligonucleotide solution. The complementary oligonucleotides were annealed to the immobilized oligomers by incubating the silicon array at ambient temperature for 1 hr, followed by incubation at 4° C. for at least 10 min. Alternatively, the oligonucleotide solution can be added to the wafer which is then heated and allowed to cool for hybridization.

[0768] The hybridized array was then washed with a solution of 50 mM ammonium citrate buffer for cation exchange to remove sodium and potassium ions on the DNA backbone (Pieles etal., (1993) Nucl. Acids Res. 21:31 91-3196). A 6-ni aliquot of a matrix solution of 3-hydroxypicolinic acid [0.7 M 3-hydroxypicolinic acid-10 % ammonium citrate in 50 % acetonitrile; see Wu et al. Rapid Commun. Mass Spectrom. 7:142-146 (1993)] was added in series to each location of the array using a robotic piezoelectric serial dispenser (i.e., a piezoelectric pipette system).

[0769] The piezoelectric pipette system is built on a system purchased from Microdrop GmbH, Norderstedt Germany and contains a piezoelectric element driver which sends a pulsed signal to a piezoelectric element bonded to and surrounding a glass capillary which holds the solution to be dispensed; a pressure transducer to load (by negative pressure) or empty (by positive pressure) the capillary; a robotic xyz stage and robot driver to maneuver the capillary for loading, unloading, dispensing, and cleaning, a stroboscope and driver pulsed at the frequency of the piezo element to enable viewing of ‘suspended’ droplet characteristics; separate stages for source and designation plates or sample targets (i.e. Si chip); a camera mounted to the robotic arm to view loading to designation plate; and a data station which controls the pressure unit, xyz robot, and piezoelectric driver.

[0770] The 3-HPA solution was allowed to dry at ambient temperature and thereafter a 6-nl aliquot of water was added to each location using the piezoelectric pipette to resuspend the dried matrix-DNA complex, such that upon drying at ambient temperature the matrix-DNA complex forms a uniform crystalline surface on the bottom surface of each location.

[0771] MALDI-TOF MS analysis

[0772] The MALDI-TOF MS analysis was performed in series on each of the 16 locations of the hybridization array illustrated in FIG. 6 essentially as described in EXAMPLE 28. The resulting mass spectrum of oligonucleotides that specifically hybridized to each of the 16 locations of the DNA hybridization revealed a specific signal at each location representative of observed experimental mass-to-charge ratio corresponding to the specific complementary nucleotide sequence.

[0773] For example, in the locations that have only Oligomer 1 conjugated thereto, the mass spectrum revealed a predominate signal with an observed experimental mass-to-charge ratio of 7072.4 approximately equal to that of the 23-mer; the theoretical mass-to-charge ratio of the 23-mer is 7072.6 Da. Similarly, specific hybridization of the 12-mer oligonucleotide to the array, observed experimental mass-to-charge ratio of 3618.33 Da (theoretical 3622.4 Da), was detected only at those locations conjugated with Oligomer 2 whereas specific hybridization of MJM6 (observed experimental mass-to-charge ratio of 6415.4) was detected only at those locations of the array conjugated with Oligomer 3 [theoretical 6407.2 Da].

[0774] None of the locations of the array revealed a signal that corresponds to the negative control 29-mer oligonucleotide (theoretical mass-to-charge ratio of 8974.8) indicating that specific target DNA molecules can be hybridized to oligomers covalently immobilized to specific locations on the surface of the silicon array and a plurality of hybridization assays may be individually monitored using MALDI-TOF MS analysis.

[0775] (b) 8×8 (64-location) array

[0776] A 2×2 cm² silicon wafer having 256 individual depressions or wells that form a 16×16 array of wells was purchased from a commercial supplier [Accelerator Technology Corp., College Station, Tex.]. The wells were 800×800 μm², 120 μm deep, on a 1.125 pitch. The silicon wafer was reacted with 3-aminopropyltriethoxysilane to produce a uniform layer of primary amines on the surface and then exposed to the heterobifunctional crosslinker SIAB resulting in iodoacetamido functionalities on the surface as described above.

[0777] To make an array of 64 elements, a pintool was used following the procedures described above. The pintool was dipped into 16 wells of a 384 well DNA source plate containing solutions of Oligomers 1-3, moved to the silicon wafer, and the sample spotted by surface contact. Next, the tool was dipped in washing solution, then dipped into the same 16 wells of the source plate, and spotted onto the target 2.25 mm offset from the initial set of 16 spots; the entire cycle was repeated to make a 2×2 array from each pin to produce an 8×8 array of spots (2×2 elements/pin×16 pins=64 total elements spotted).

[0778] Oligomers 1-3 immobilized to the 64 locations were hybridized to complementary oligonucleotides and analyzed by MALDI-TOF MS analysis. As observed for the 16-location array, specific hybridization of the complementary oligonucleotide to each of the immobilized thiol-containing oligomers was observed in each of the locations of the DNA array.

EXAMPLE 30

[0779] Extension of hybridized DNA primers bound to DNA templates immobilized on a silicon wafer

[0780] The SIAB-derivatized silicon wafers can also be employed for primer extension reactions of the immobilized DNA template using the procedures essentially described in EXAMPLE 7.

[0781] A 27-mer oligonucleotide [SEQ ID NO: 125] containing a 3′-free thiol group was coupled to a SIAB-derivatized silicon wafer as described above, for example, in EXAMPLE 28. A 12-mer oligonucleotide primer [SEQ ID NO: 126] was hybridized to the immobilized oligonucleotide and the primer was extended using a commercially available kit [e.g., Sequenase or ThermoSequenase, U.S. Biochemical Corp]. The addition of Sequenase DNA polymerase or ThermoSequenase DNA polymerase in the presence of three deoxyribonucleoside triphosphates (dNTPs; dATP, dGTP, dCTP) and dideoxyribonucleoside thymidine triphosphate (ddTTP) in buffer according to the instructions provided by the manufacturer resulted in a 3-base extension of the 12-mer primer while still bound to the silicon wafer. The wafer was then analyzed by MALDI-TOF mass spectrometry as described above. The mass spectrum results clearly distinguish the 15-mer [SEQ ID NO: 127] from the original unextended 12-mer thus indicating that specific extension can be performed on the surface of a silicon wafer and detected using MALDI-TOF MS analysis.

EXAMPLE 31

[0782] Effect of linker length on polymerase extension of hybridized DNA primers bound to DNA templates immobilized on a silicon wafer

[0783] The effect of the distance between the SlAB-conjugated silicon surface and the duplex DNA formed by hybridization of the target DNA to the immobilized oligomer template was investigated, as well as choice of enzyme.

[0784] Two SIAB-derivatized silicon wafers were conjugated to the 3′-end of two free thiol-containing oligonucleotides of identical DNA sequence except for a 3-base poly dT spacer sequence incorporated at the 3′-end: CTGATGCGTC GGATCATCTT TTTT SEQ ID No. 122 CTGATGCGTC GGATCATCTT TTTTTTT SEQ ID No. 125.

[0785] These oligonucleotides were synthesized and each was separately immobilized to the surface of a silicon wafer through the SIAB cross-linker [e.g., see EXAMPLE 28]. Each wafer was incubated with a 12-mer oligonucleotide: AAAAAAGATG AT SEQ ID No. 126 GATGATCCGA CG SEQ ID No. 128 GATCCGACGC AT SEQ ID No. 129,

[0786] which is complementary to portions of the nucleotide sequences common to both of the oligonucleotides, by denaturing at 75 ° C. and slow cooling the silicon wafer. The wafers were then analyzed by MALDI-TOF mass spectrometry as described above.

[0787] As described in EXAMPLE 30 above, a 3-base specific extension of the bound 12-mer oligonucleotide was observed using the oligomer primer where there is a 9-base spacer between the duplex and the surface [SEQ ID NO: 125]. Similar results were observed when the DNA spacer lengths between the SIAB moiety and the DNA duplex were 0, 3, 6 and 1 2. In addition, the extension reaction may be performed using a variety of DNA polymerases, such as Sequenase and Thermo Sequenase (US Biochemical). Thus, the SIAB linker may be directly coupled to the DNA template or may include a linker sequence without effecting primer extension of the hybridized DNA.

EXAMPLE 32

[0788] Spectrochip mutant detection in ApoE gene

[0789] This example describes the hybridization of an immobilized template, primer extension and mass spectrometry for detection of the wildtype and mutant Apolipoprotein E gene for diagnostic purposes. This example demonstrates that immobilized DNA molecules containing a specific sequence can be detected and distinguished using primer extension of unlabeled allele specific primers and analysis of the extension products using mass spectrometry.

[0790] A 50 base synthetic DNA template complementary to the coding sequence of allele 3 of the wildtype apolipoprotein E gene: 5′-GCCTGGTACACTGCCAGGCGCTTCTGCAGGTCATCGGCATCGCGGAGGAG-3′ [SEQ ID NO: 280]

[0791] or complement to the mutant apolipoprotein E gene carrying a G→A transition at codon 158: 5′-GCCTGGTACACTGCCAGGCACTTCTGCAGGTCATCGGCATCGCGGAGGAG-3′ [SEQ ID NO: 281]

[0792] containing a 3′-free thiol group was coupled to separate SIAB-derivatized silicon wafers as described in Example 28.

[0793] A 21-mer oligonucleotide primer: 5′-GAT GCC GAT GAC CTG CAG AAG-3′ [SEQ ID NO: 282] was hybridized to each of the immobilized templates and the primer was extended using a commercially available kit [e.g., Sequenase or Thermosequenase, U.S. Biochemical Corp]. The addition of Sequenase DNA polymerase or Thermosequenase DNA polymerase in the presence of three deoxyribonucleoside triphosphates (dNTPs; dATP, dGTP, dTTP) and dideoxyribonucleoside cytosine triphosphate (ddCTP) in buffer according to the instructions provided by the manufacturer resulted in a single base extension of the 21 -mer primer bound to the immobilized template encoding the wildtype apolipoprotein E gene and a three base extension of the 21-mer primer bound to the immobilized template encoding the mutant form of apolipoprotein E gene.

[0794] The wafers were analyzed by mass spectrometry as described herein. The wildtype apolipoprotein E sequence results in a mass spectrum that distinguishes the primer with a single base extension (22-mer) with a mass to charge ratio of 6771.1 7 Da (the theoretical mass to charge ratio is 6753.5 Da) from the original 21-mer primer with a mass to charge ration of 6499.64 Da. The mutant apolipoprotein E sequence results in a mass spectrum that distinguishes the primer with a three base extension (24-mer) with a mass to charge ratio of 7386.9 (the theoretical mass charge is 7386.9) from the original 21-mer primer with a mass to charge ration of 6499.64 Da.

EXAMPLE 33

[0795] Detection of Double-Stranded Nucleic Acid Molecules via Strand Displacement and Hybridization to an Immobilized Complementary Nucleic Acid

[0796] This example describes immobilization of a 24-mer primer and the specific hybridization of one strand of a duplex DNA molecule, thereby permitting amplication of a selected target molecule in solution phase and permitting detection of the double stranded molecule. This method is useful for detecting single base changes, and, particularly for screening genomic libraries of double-stranded fragments.

[0797] A 24-mer DNA primer CTGATGCGTC GGATCATCTT TTTT SEQ ID No. 122, containing a 3′-free thiol group was coupled to a SIAB-derivatized silicon wafer as described in Example 29.

[0798] An 18-mer synthetic oligonucleotide: 5′-CTGATGCGTCGGATCATC-3′ [SEQ ID NO: 286] was premixed with a 12-mer 5′-GATGATCCGACG-3′ [SEQ ID NO: 285] that has a sequence that is complementary to 12 base portion of the 18-mer oligonucleotide. The oligonucleotide mix was heated to 75° C. and cooled slowly to room temperature to faciliate the formation of a duplex molecule: 5′-CTGATGCGTCGGATCATC-3′ [SEQ ID NO. 286]      3′- GCAGCCTAGTAG-5′ [SEQ ID NO: 287].

[0799] The specific hybridization of the 12-mer strand of the duplex molecule to the immobilized 24-mer primer was carried out by mixing 1 μM of the duplex molecule using the hybridization conditions described in Example 30.

[0800] The wafers were analyzed by mass spectrometry as described above. Specific hybridization was detected in a mass spectrum of the 12-mer with a mass to charge ratio of 3682.78 Da.

EXAMPLE 34

[0801] 1-(2-Nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)phenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane

[0802] A. 2-Nitro-5-(3-hydroxypropoxy)benzaldehyde

[0803] 3-Bromo-1-propanol (3.34 g, 24 mmol) was refluxed in 80 ml of anhydrous acetonitrile with 5-hydroxy-2-nitrobenzaldehyde (3.34 g, 20 mmol), K₂CO₃ (3.5 g), and Kl (100 mg) overnight (15 h). The reaction mixture was cooled to room temperature and 150 ml of methylene chloride was added. The mixture was filtered and the solid residue was washed with methylene chloride. The combined organic solution was evaporated to dryness and redissolved in 100 ml methylene chloride. The resulted solution was washed with saturated NaCl solution and dried over sodium sulfate. 4.31 g (96%) of desired product was obtained after removal of the solvent in vacuo.

[0804] Rf=0.33 (dichloromethane/methanol, 95/5).

[0805] UV (methanol) maximum: 313, 240 (shoulder), 215 nm; minimum: 266 nm.

[0806]¹H NMR (DMSO-d₆) δ10.28 (s, 1H), 8.17 (d, 1H), 7.35 (d, 1H), 7.22 (s, 1H), 4.22(t, 2H), 3.54 (t, 2H), 1.90 (m, 2H).

[0807]¹³C NMR (DMSO-d₆) δ189.9, 153.0, 141.6, 134.3, 127.3, 118.4, 114.0, 66.2, 56.9, 31.7.

[0808] B. 2-Nitro-5-(3-O-t-butyidimethylsilylpropoxy)benzaldehyde

[0809] 2-Nitro-5-(3-hydroxypropoxy)benzaldehyde(1 g, 4.44 mmol) was dissolved in 50 ml anhydrous acetonitrile. To this solution, it was added 1 ml of triethylamine, 200 mg of imidazole, and 0.8 g (5.3 mmol) of tBDMSCl. The mixture was stirred at room temperature for 4 h. Methanol (1 ml) was added to stop the reaction. The solvent was removed in vacuo and the solid residue was redissolved in 100 ml methylene chloride. The resulted solution was washed with saturated sodium bicarbonate solution and then water. The organic phase was dried over sodium sulfate and the solvent was removed in vacuo. The crude mixture was subjected to a quick silica gel column with methylene chloride to yield 1.44 g (96%) of 2-nitro-5-(3-O-t-butyldimethylsilylpropoxy) benzaldehyde.

[0810] R_(f)=0.67 (hexane/ethyl acetate, 5/1).

[0811] UV (methanol), maximum: 317, 243, 215 nm; minimum: 235, 267 nm.

[0812]¹H NMR (DMSO-d₆) δ10.28 (s, 1H), 8.14 (d, 1H), 7.32 (d, 1H), 7.20 (s, 1H), 4.20 (t, 2H), 3.75 (t, 2H), 1.90 (m, 2H), 0.85 (s, 9H), 0.02 (s, 6H).

[0813]¹³C NMR (DMSO-d₆) δ189.6, 162.7, 141.5, 134.0, 127.1, 118.2, 113.8, 65.4, 58.5, 31.2, 25.5, −3.1, −5.7.

[0814] C. 1-(2-Nitro-5-(3-O-t-butyidimethylsilylpropoxy)phenyl)ethanol

[0815] High vacuum dried 2-nitro-5-(3-O-t-butyldimethylsilylpropoxy)benzaldehyde (1.02 g, 3 mmol) was dissolved 50 ml of anhydrous methylene chloride. 2 M Trimethylaluminium in toluene (3 ml) was added dropwise within 10 min and keeped the reaction mixture at room temperature. It was stirred further for 10 min and the mixture was poured into 10 ml ice cooled water. The emulsion was separated from water phase and dried over 100 g of sodium sulfate to remove the remaining water. The solvent was removed in vacuo and the mixture was applied to a silica gel column with gradient methanol in methylene chloride. 0.94 g (86%) of desired product was isolated.

[0816] R_(f)=0.375 (hexane/ethyl acetate, 5/1).

[0817] UV (methanol), maximum: 306, 233, 206 nm; minimum: 255, 220 nm.

[0818]¹H NMR (DMSO-d₆) δ8.00 (d, 1H), 7.36 (s, 1H), 7.00 (d, 1 H), 5.49 (b, OH), 5.31 (q, 1H), 4.19 (m, 2H), 3.77 (t, 2H), 1.95 (m, 2H), 1.37 (d, 3H), 0.86 (s, 9H), 0.04 (s, 6H).

[0819]¹³C NMR (DMSO-d₆) δ162.6, 146.2, 139.6, 126.9, 112.9, 112.5, 64.8, 63.9, 58.7, 31.5, 25.6, 24.9, −3.4, −5.8.

[0820] D. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol

[0821] 1-(2-Nitro-5-(3-O-t-butyldimethylsilylpropoxy)phenyl)ethanol (0.89 g, 2.5 mmol) was dissolved in 30 ml of THF and 0.5 mmol of nBu₄NF was added under stirring. The mixture was stirred at room temperature for 5 h and the solvent was removed in vacuo. The remaining residue was applied to a silica gel column with gradient methanol in methylene chloride. 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.6 g (99%) was obtained.

[0822] R_(f)=0.17 (dichloromethane/methanol, 95/5).

[0823] UV (methanol), maximum: 304, 232, 210 nm; minimum: 255, 219 nm.

[0824]¹H NMR (DMSO-d₆) δ8.00 (d, 1H), 7.33 (s, 1H), 7.00 (d, 1H), 5.50 (d, OH), 5.28 (t, OH), 4.59 (t, 1H), 4.17 (t, 2H), 3.57 (m, 2H), 1.89 (m, 2H), 1.36 (d, 2H).

[0825]¹³C NMR (DMOS-d₆) δ162.8, 146.3, 139.7, 127.1, 113.1, 112.6, 65.5, 64.0, 57.0, 31.8, 25.0.

[0826] E. 1-(2-Nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)phenyl)ethanol

[0827] 1-(2-Nitro-5-(3-hydroxypropoxy)phenyl)ethanol (0.482 g, 2 mmol) was co-evaporated with anhydrous pyridine twice and dissolved in 20 ml anhydrous pyridine. The solution was cooled in ice-water bath and 750 mg (2.2 mmol) of DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml methanol was added to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with toluene twice to remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol in methylene chloride containing drops of triethylamine to yield 0.96 g (89%) of the desired product 1-(2-nitro-5-(3-O-4,4′-dimethoxytrityl-propoxy)phenyl)ethanol.

[0828] R_(f)=0.50 (dichloromethane/methanol, 99/1).

[0829] UV (methanol), maximum: 350 (shoulder), 305, 283, 276 (shoulder), 233, 208 nm; minimum: 290, 258, 220 nm.

[0830]¹H NMR (DMSO-d₆) δ8.00 (d, 1H), 6.82-7.42 (ArH), 5.52 (d, OH), 5.32 (m, 1H), 4.23 (t, 2H), 3.71 (s, 6H), 3.17 (t, 2H), 2.00 (m, 2H), 1.37 (d, 3H).

[0831]¹³C NMR (DMOS-d₆) δ162.5, 157.9, 157.7, 146.1, 144.9, 140.1, 139.7, 135.7, 129.5, 128.8, 127.6, 127.5, 127.3, 126.9, 126.4, 113.0, 112.8, 112.6, 85.2, 65.3, 63.9, 59.0, 54.8, 28.9, 24.9.

[0832] F. 1-(2-Nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)phenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane

[0833] 1-(2-Nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)phenyl)ethanol (400 mg, 0.74 mmol) was dried under high vacuum and was dissolved in 20 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N,N-diisopropylethylamine and 0.3 ml (1.34 mmol) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction mixture was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture was washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed in vacuo and a quick silica gel column with 1% methanol in methylene chloride containing drops of triethylamine yield 510 mg (93%) the desired phosphoramidite.

[0834] R_(f)=0.87 (dichloromethane/methanol, 99/1).

EXAMPLE 35

[0835] 1-(4-(3-O-4,4′-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane

[0836] A. 4-(3-Hydroxypropoxy)-3-methoxyacetophenone

[0837] 3-Bromo-1-propanol (53 ml, 33 mmol) was refluxed in 100 ml of anhydrous acetonitrile with 4-hydroxy-3-methoxyacetophenone (5 g, 30 mmol), K₂CO₃ (5 g), and Kl (300 mg) overnight (15 h). Methylenechloride (150 ml) was added to the reaction mixture after cooling to room temperature. The mixture was filtered and the solid residue was washed with methylene chloride. The combined organic solution was evaporated to dryness and redissolved in 100 ml methylene chloride. The resulted solution was washed with saturated NaCl solution and dried over sodium sulfate. 6.5 g (96.4%) of desired product was obtained after removal of the solvent in vacuo.

[0838] R_(f)=0.41 (dichloromethane/methanol, 95/5).

[0839] UV (methanol), maximum: 304, 273, 227, 210 nm: minimum: 291, 244, 214 nm.

[0840]¹H NMR (DMSO-d₆) δ7.64 (d, 1H), 7.46 (s, 1H), 7.04 (d, 1H), 4.58 (b, OH), 4.12 (t, 2H), 3.80 (s, 3H), 3.56 (t, 2H), 2.54 (s, 3H), 1.88 (m, 2H).

[0841]¹³C NMR (DMSO-d₆) δ196.3, 152.5, 148.6, 129.7, 123.1, 111.5, 110.3, 65.4, 57.2, 55.5, 31.9, 26.3.

[0842] B. 4-(3-Acetoxypropoxy)-3-methoxyacetophenone

[0843] 4-(3-Hydroxypropoxy)-3-methoxyacetophenone (3.5 g, 15.6 mmol) was dried and dissolved in 80 ml anhydrous acetonitrile. This mixture, 6 ml of triethylamine and 6 ml of acetic anhydride were added. After 4 h, 6 ml methanol was added and the solvent was removed in vacuo. The residue was dissolved in 100 ml dichloromethane and the solution was washed with dilute sodium bicarbonate solution, then water. The organic phase was dried over sodium sulfate and the solvent was removed. The solid residue was applied to a silica gel column with methylene chloride to yield 4.1 g of 4-(3-acetoxypropoxy)-3-methoxyacetophenone (98.6%).

[0844] R_(f)=0.22 (dichloromethane/methanol, 99/1).

[0845] UV (methanol), maximum: 303, 273, 227, 210 nm; minimum: 290, 243, 214 nm.

[0846]¹H NMR (DMSO-d₆) δ7.62 (d, 1H), 7.45 (s, 1H), 7.08 (d, 1H), 4.12 (m, 4H, 3.82 (s, 3H), 2.54 (s, 3H), 2.04 (m, 2H), 2.00 (s, 3H).

[0847]¹³C NMR (DMSO-d₆) δ196.3, 170.4, 152.2, 148.6, 130.0, 123.0, 111.8, 110.4, 65.2, 60.8, 55.5, 27.9, 26.3, 20.7.

[0848] C. 4-(3-Acetoxypropoxy)-3-methoxy-6-nitroacetophenone

[0849] 4-(3-Acetoxypropoxy)-3-methoxyacetophenone (3.99 g, 15 mmol) was added portionwise to 15 ml of 70% HNO₃ in water bath and keep the reaction temperature at the room temperature. The reaction mixture was stirred at room temperature for 30 min and 30 g of crushed ice was added. This mixture was extracted with 100 ml of dichloromethane and the organic phase was washed with saturated sodium bicarbonate solution. The solution was dried over sodium sulfate and the solvent was removed in vacuo. The crude mixture was applied to a silica gel column with gradient methanol in methylene chloride to yield 3.8 g (81.5%) of desired product 4-(3-acetoxypropoxy)-3-methoxy-6-nitroacetophenone and 0.38 g (8%) of ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1,2-dinitrobenzene. Side ipso-substituted product 5-(3-acetoxypropoxy)-4-methoxy-1,2-dinitrobenzene:

[0850] R_(f)=0.47 (dichloromethane/methanol, 99/1).

[0851] UV (methanol), maximum: 334, 330, 270, 240, 212 nm; minimum: 310, 282, 263, 223 nm. ¹H NMR (CDCl₃) δ7.36 (s, 1H), 7.34 (s, 1H), 4.28 (t, 2H), 4.18 (t, 2H), 4.02 (s, 3H), 2.20 (m, 2H), 2.08 (s, 3H).

[0852]¹³C NMR (CDCl³) δ170.9, 152.2, 151.1, 117.6, 111.2, 107.9, 107.1, 66.7, 60.6, 56.9, 28.2, 20.9.

[0853] Desired product 4-(3-acetoxypropoxy)-3-methoxy-6-nitroacetophenone:

[0854] R_(f)=0.29 (dichloromethane/methanol, 99/1).

[0855] UV (methanol), maximum: 344, 300, 246, 213 nm; minimum: 320, 270, 227 nm.

[0856]¹H NMR (CDCl₃) δ7.62 (s, 1H), 6.74 (s, 1H), 4.28 (t, 2H), 4.20 (t, 2H), 3.96 (s, 3H), 2.48 (s, 3H), 2.20 (m, 2H), 2.08 (s, 3H).

[0857]¹³C NMR (CDCl₃) δ200.0, 171.0, 154.3, 148.8, 138.3, 133.0, 108.8, 108.0, 66.1, 60.8, 56.6, 30.4, 28.2, 20.9.

[0858] D. 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol

[0859] 4-(3-Acetoxypropoxy)-3-methoxy-6-nitroacetophenone (3.73 g, 12 mmol) was added 150 ml ethanol and 6.5 g of K₂CO₃. The mixture was stirred at room temperature for 4 h and TLC with 5% methanol in dichloromethane indicated the completion of the reaction. To this same reaction mixture, it was added 3.5 g of NaBH₄ and the mixture was stirred at room temperature for 2 h. Acetone (10 ml) was added to react with the remaining NaBH₄. The solvent was removed in vacuo and the residue was uptaken into 50 g of silica gel. The silica gel mixture was applied on the top of a silica gel column with 5% methanol in methylene chloride to yield 3.15 g (97%) of desired product 1-(4-(3-hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol.

[0860] Intermediate product 4-(3-hydroxypropoxy)-3-methoxy-6-nitroacetophenone after deprotection:

[0861] R_(f)=0.60 (dichloromethane/methanol, 95/5).

[0862] Final product 1-(4-(3-hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol:

[0863] R_(f)=0.50 (dichloromethane/methanol, 95/5).

[0864] UV (methanol), maximum: 344, 300, 243, 219 nm: minimum: 317, 264, 233 nm.

[0865]¹H NMR (DMSO-d₆) δ7.54 (s, 1H), 7.36 (s, 1H), 5.47 (d, OH), 5.27 (m, 1H), 4.55 (t, OH), 4.05 (t, 2H), 3.90 (s, 3H), 3.55 (q, 2H), 1.88 (m, 2H), 1.37 (d, 3H).

[0866]¹³C NMR (DMSO-d₆) δ153.4, 146.4, 138.8, 137.9, 109.0, 108.1, 68.5, 65.9, 57.2, 56.0, 31.9, 29.6.

[0867] E. 1-(4-(3-O-4,4′-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol

[0868] 1-(4-(3-Hydroxypropoxy)-3-methoxy-6-nitrophenyl)ethanol (0.325 g, 1.2 mmol) was co-evaporated with anhydrous pyridine twice and dissolved in 15 ml anhydrous pyridine. The solution was cooled in ice-water bath and 450 mg (1.33 mmol) of DMTCI was added. The reaction mixture was stirred at room temperature overnight and 0.5 ml methanol was added to stop the reaction. The solvent was removed in vacuo and the residue was co-evaporated with toluene twice to remove trace of pyridine. The final residue was applied to a silica gel column with gradient methanol in methylene chloride containing drops of triethylamine to yield 605 mg (88%) of desired product 1-(4-(3-O-4,4′-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol.

[0869] R_(f)=0.50 (dichloromethane/methanol, 95/5).

[0870] UV (methanol), maximum: 354, 302, 282, 274, 233, 209 nm; minimum: 322, 292, 263, 222 nm.

[0871]¹H NMR (DMSO-d₆) δ7.54 (s, 1H), 6.8-7.4 (ArH), 5.48 (d, OH), 5.27 (m, 1H), 4.16 (t, 2H), 3.85 (s, 3H), 3.72 (s, 6H), 3.15 (t, 2H), 1.98 (t, 2H), 1.37 (d, 3H).

[0872]¹³C NMR (DMSO-d₆) δ157.8, 153.3, 146.1, 144.9, 138.7, 137.8, 135.7, 129.4, 128.7, 127.5, 127.4, 126.3, 112.9, 112.6, 108.9, 108.2, 85.1, 65.7, 63.7, 59.2, 55.8, 54.8, 29.0, 25.0.

[0873] F. 1-(4-(3-O-4,4′-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane

[0874] 1-(4-(3-O-4,4′-Dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)ethanol (200 mg, 3.5 mmol) was dried under high vacuum and was dissolved in 15 ml of anhydrous methylene chloride. To this solution, it was added 0.5 ml N,N-diisopropylethylamine and 0.2 ml (0.89 mmol) of 2-cyanoethyl-N,N-diisopropylchlorophosphoramidite. The reaction mixture was stirred at room temperature for 30 min and 0.5 ml of methanol was added to stop the reaction. The mixture was washed with saturated sodium bicarbonate solution and was dried over sodium sulfate. The solvent was removed in vacuo and a quick silica gel column with 1% methanol in methylene chloride containing drops of triethylamine yield 247 mg (91.3%) the desired phosphoramidite 1-(4-(3-O-4,4′-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane.

[0875] R_(f)=0.87 (dichloromethane/methanol, 99/1).

EXAMPLE 36

[0876] Oligonucleotide synthesis

[0877] The oligonucleotide conjugates containing photocleavable linker were prepared by solid phase nucleic acid synthesis (see: Sinha et al. Tetrahedron Lett. 1983, 24, 5843-5846; Sinha et al. Nucleic Acids Res. 1984, 12, 4539-4557; Beaucage et al. Tetrahedron 1993, 49, 6123-6194; and Matteucci et al. J. Am. Chem. Soc. 1981, 103, 3185-3191) under standard conditions. In addition a longer coupling time period was employed for the incorporation of photocleavable unit and the 5′ terminal amino group. The coupling efficiency was detected by measuring the absorbance of released DMT cation and the results indicated a comparable coupling efficiency of phosphoramidite 1-(2-nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)phenyl)-1 -O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane or 1-(4-(3-O-4,4′-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)-1 -O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane with those of common nucleoside phosphoramodites. Deprotection of the base protection and release of the conjugates from the solid support was carried out with concentrated ammonium at 55° C. overnight. Deprotection of the base protection of other conjugates was done by fast deprotection with AMA reagents. Purification of the MMT-on conjugates was done by HPLC (trityl-on) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonitrile (5% to 25% in 20 minutes). The collected MMT or DMT protected conjugate was reduced in volume, detritylated with 80% aqueous acetic acid (40 min, 0° C.), desalted, stored at −20° C.

EXAMPLE 37

[0878] Photolysis study

[0879] In a typical case, 2 nmol of oligonucleotide conjugate containing photocleavable linker in 200 μl distilled water was irradiated with a long wavelength UV lamp (Blak Ray XX-15 UV lamp, Ultraviolet products, San Gabriel, Calif.) at a distance of 10 cm (emission peak 365 nm, lamp intensity=1.1 mW/cm² at a distance of 31 cm). The resulting mixture was analyzed by HPLC (trityl-off) using 0.1 M triethylammonium acetate, pH 7.0 and a gradient of acetonitrile. Analysis showed that the conjugate was cleaved from the linder within minutes upon UV irradiation.

[0880] Equivalents

[0881] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the scope of this invention and are covered by the following claims.

0 SEQUENCE LISTING (1) GENERAL INFORMATION: (iii) NUMBER OF SEQUENCES: 320 (2) INFORMATION FOR SEQ ID NO: 1: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: GCAAGTGAAT CCTGAGCGTG 20 (2) INFORMATION FOR SEQ ID NO: 2: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: GTGTGAAGGG TTCATATGC 19 (2) INFORMATION FOR SEQ ID NO: 3: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: ATCTATATTC ATCATAGGAA ACACCACA 28 (2) INFORMATION FOR SEQ ID NO: 4: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: GTATCTATAT TCATCATAGG AAACACCATT 30 (2) INFORMATION FOR SEQ ID NO: 5: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: GCTTTGGGGC ATGGACATTG ACCCGTATAA 30 (2) INFORMATION FOR SEQ ID NO: 6: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: CTGACTACTA ATTCCCTGGA TGCTGGGTCT 30 (2) INFORMATION FOR SEQ ID NO: 7: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: TTGCCTGAGT GCAGTATGGT 20 (2) INFORMATION FOR SEQ ID NO: 8: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: AGCTCTATAT CGGGAAGCCT 20 (2) INFORMATION FOR SEQ ID NO: 9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: TTGTGCCACG CGGTTGGGAA TGTA 24 (2) INFORMATION FOR SEQ ID NO: 10: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: AGCAACGACT GTTTGCCCGC CAGTTG 26 (2) INFORMATION FOR SEQ ID NO: 11: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: TACATTCCCA ACCGCGTGGC ACAAC 25 (2) INFORMATION FOR SEQ ID NO: 12: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: AACTGGCGGG CAAACAGTCG TTGCT 25 (2) INFORMATION FOR SEQ ID NO: 13: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: GCAAGTGAAT CCTGAGCGTG 20 (2) INFORMATION FOR SEQ ID NO: 14: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: GTGTGAAGGG CGTG 14 (2) INFORMATION FOR SEQ ID NO: 15: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: CTATATTCAT CATAGGAAAC ACCA 24 (2) INFORMATION FOR SEQ ID NO: 16: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: GTCACCCTCG ACCTGCAG 18 (2) INFORMATION FOR SEQ ID NO: 17: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: TTGTAAAACG ACGGCCAGT 19 (2) INFORMATION FOR SEQ ID NO: 18: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: CTTCCACCGC GATGTTGA 18 (2) INFORMATION FOR SEQ ID NO: 19: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: CAGGAAACAG CTATGAC 17 (2) INFORMATION FOR SEQ ID NO: 20: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: GTAAAACGAC GGCCAGT 17 (2) INFORMATION FOR SEQ ID NO: 21: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: GTCACCCTCG ACCTGCAGC 19 (2) INFORMATION FOR SEQ ID NO: 22: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: GTTGTAAAAC GAGGGCCAGT 20 (2) INFORMATION FOR SEQ ID NO: 23: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: TCTGGCCTGG TGCAGGGCCT ATTGTAGTTG TGACGTACA 39 (2) INFORMATION FOR SEQ ID NO: 24: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: TGTACGTCAC AACT 14 (2) INFORMATION FOR SEQ ID NO: 25: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 78 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: AAGATCTGAC CAGGGATTCG GTTAGCGTGA CTGCTGCTGC TGCTGCTGCT GCTGGATGAT 60 CCGACGCATC AGATCTGG 78 (2) INFORMATION FOR SEQ ID NO: 26: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: CTGATGCGTC GGATCATC 18 (2) INFORMATION FOR SEQ ID NO: 27: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: GATGATCCGA CGCATCACAG CTC 23 (2) INFORMATION FOR SEQ ID NO: 28: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: TCGGTTCCAA GAGCTGTGAT GCGTCGGATC ATC 33 (2) INFORMATION FOR SEQ ID NO: 29: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: GATGATCCGA CGCATCACAG CTC 23 (2) INFORMATION FOR SEQ ID NO: 30: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: GTGATGCGTC GGATCATC 18 (2) INFORMATION FOR SEQ ID NO: 31: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: TCGGTTCCAA GAGCT 15 (2) INFORMATION FOR SEQ ID NO: 32: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: CATTTGCTTC TGACACAACT G 21 (2) INFORMATION FOR SEQ ID NO: 33: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: CTTCTCTGTC TCCACATGC 19 (2) INFORMATION FOR SEQ ID NO: 34: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: TGCACCTGAC TC 12 (2) INFORMATION FOR SEQ ID NO: 35: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: TGCTTACTTA ACCCAGTGTG 20 (2) INFORMATION FOR SEQ ID NO: 36: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: CACACTATGT AATACTATGC 20 (2) INFORMATION FOR SEQ ID NO: 37: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: GAAAATATCT GACAAACTCA TC 22 (2) INFORMATION FOR SEQ ID NO: 38: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: CATGGACACC AAATTAAGTT C 21 (2) INFORMATION FOR SEQ ID NO: 39: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: TGAGACTCTG TCTC 14 (2) INFORMATION FOR SEQ ID NO: 40: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: TTCCCCAAAT CCCTG 15 (2) INFORMATION FOR SEQ ID NO: 41: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: GGCACGGCTG TCCAAGGAG 19 (2) INFORMATION FOR SEQ ID NO: 42: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: AGGCCGCGCT CGGCGCCCTC 20 (2) INFORMATION FOR SEQ ID NO: 43: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: GCGGACATGG AGGACGTG 18 (2) INFORMATION FOR SEQ ID NO: 44: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: GATGCCGATG ACCTGCAGAA G 21 (2) INFORMATION FOR SEQ ID NO: 45: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: CCCTTACCCT TACCCTTACC CTAA 24 (2) INFORMATION FOR SEQ ID NO: 46: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: AATCCGTGCA GCAGAGTT 18 (2) INFORMATION FOR SEQ ID NO: 47: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: TGTCAGAGCT GGACAAGTGT 20 (2) INFORMATION FOR SEQ ID NO: 48: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: GATATTGTCT TCCCGGTAGC 20 (2) INFORMATION FOR SEQ ID NO: 49: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: CTCGGACCAG GTGTACCGCC 20 (2) INFORMATION FOR SEQ ID NO: 50: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: CCTGTACTGG AAGGCGATCT C 21 (2) INFORMATION FOR SEQ ID NO: 51: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: CATGAGGCAG AGCATACGCA 20 (2) INFORMATION FOR SEQ ID NO: 52: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: GACAGCAGCA CCGAGACGAT 20 (2) INFORMATION FOR SEQ ID NO: 53: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: CGGCTGCGAT CACCGTGCGG 20 (2) INFORMATION FOR SEQ ID NO: 54: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: GATCCACTGT GCGACGAGC 19 (2) INFORMATION FOR SEQ ID NO: 55: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: GCGGCTGCGA TCACCGTGC 19 (2) INFORMATION FOR SEQ ID NO: 56: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: TGCACCTGAC TC 12 (2) INFORMATION FOR SEQ ID NO: 57: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: CTGTGGTCGT GC 12 (2) INFORMATION FOR SEQ ID NO: 58: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGCGC 39 (2) INFORMATION FOR SEQ ID NO: 59: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: TCTCTGTCTC CACATGCCCA G 21 (2) INFORMATION FOR SEQ ID NO: 60: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 61 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 C 61 (2) INFORMATION FOR SEQ ID NO: 61: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: AACTAAGCCA TGTGCACAAC A 21 (2) INFORMATION FOR SEQ ID NO: 62: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: UCCGGUCUGA UGAGUCCGUG AGGAC 25 (2) INFORMATION FOR SEQ ID NO: 63: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: GUCACUACAG GUGAGCUCCA 20 (2) INFORMATION FOR SEQ ID NO: 64: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: CCAUGCGAGA GUAAGUAGUA 20 (2) INFORMATION FOR SEQ ID NO: 65: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: AGGCCUGCGG CAAGACGGAA AGACCAUGGU CCCUNAUCUG CCGCAGGAUC 50 (2) INFORMATION FOR SEQ ID NO: 66: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: CATTTGCTTC TGACACAACT 20 (2) INFORMATION FOR SEQ ID NO: 67: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: TCTCTGTCTC CACATGCCCA G 21 (2) INFORMATION FOR SEQ ID NO: 68: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: GTCGTCCCAT GGTGCACCTG ACTC 24 (2) INFORMATION FOR SEQ ID NO: 69: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: CGCTGTGGTG AGGCCCTGGG CA 22 (2) INFORMATION FOR SEQ ID NO: 70: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: GACGACGACT GCTACCTGAC TCCA 24 (2) INFORMATION FOR SEQ ID NO: 71: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: ACAGCGGACT GCTACCTGAC TCCA 24 (2) INFORMATION FOR SEQ ID NO: 72: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: TGGAGTCAGG TAGCAGTC 18 (2) INFORMATION FOR SEQ ID NO: 73: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: CAGCTCTCAT TTTCCATAC 19 (2) INFORMATION FOR SEQ ID NO: 74: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: AGCCCCAAGA TGACTATC 18 (2) INFORMATION FOR SEQ ID NO: 75: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: CGAGGAGCTC AAGGCCAGAA T 21 (2) INFORMATION FOR SEQ ID NO: 76: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: CAGGGGCAGC TCAGCTCTC 19 (2) INFORMATION FOR SEQ ID NO: 77: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: GGCACGGCTG TCCAAGGA 18 (2) INFORMATION FOR SEQ ID NO: 78: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 AGGCCGCGCT CGGCGCCCTC 20 (2) INFORMATION FOR SEQ ID NO: 79: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: CTTACTTGAA TTCCAAGAGC 20 (2) INFORMATION FOR SEQ ID NO: 80: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: GGGCTGACTT GCATGGACCG GA 22 (2) INFORMATION FOR SEQ ID NO: 81 (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: AGCCAGGACA AG 12 (2) INFORMATION FOR SEQ ID NO: 82: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: ACAGCAGGAA CAGCA 15 (2) INFORMATION FOR SEQ ID NO: 83: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: GCGGACATGG AGGACGTG 18 (2) INFORMATION FOR SEQ ID NO: 84: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: GATGCCGATG ACCTGCAGAA G 21 (2) INFORMATION FOR SEQ ID NO: 85: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: GTGCCCTGCA GCTTCACTGA AGAC 24 (2) INFORMATION FOR SEQ ID NO: 86: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: AGCCAGGACA AG 12 (2) INFORMATION FOR SEQ ID NO: 87: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: AGCCAGGACA AGTC 14 (2) INFORMATION FOR SEQ ID NO: 88: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: AGCCAGGACA AGA 13 (2) INFORMATION FOR SEQ ID NO: 89: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: ACAGCACCAA CAGCA 15 (2) INFORMATION FOR SEQ ID NO: 90: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: ACAGCAGGAA CAGCATC 17 (2) INFORMATION FOR SEQ ID NO: 91: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: ACAGCAGGAA CAGCAG 16 (2) INFORMATION FOR SEQ ID NO: 92: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: GCGGACATGG AGGACGTG 18 (2) INFORMATION FOR SEQ ID NO: 93: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: GCGGACATGG AGGACGTGGC 20 (2) INFORMATION FOR SEQ ID NO: 94: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: GCGGACATGG AGGACGTGC 19 (2) INFORMATION FOR SEQ ID NO: 95: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: GATGCCGATG ACCTGCAGAA G 21 (2) INFORMATION FOR SEQ ID NO: 96: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: GATGCCGATG ACCTGCAGAA GC 22 (2) INFORMATION FOR SEQ ID NO: 97: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: GATGCCGATG ACCTGCAGAA GTG 23 (2) INFORMATION FOR SEQ ID NO: 98: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: GTGCCCTGCA GCTTCACTGA AGAC 24 (2) INFORMATION FOR SEQ ID NO: 99: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: GTGCCCTGCA GCTTCACTGA AGACTG 26 (2) INFORMATION FOR SEQ ID NO: 100: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: GTGCCCTGCA GCTTCACTGA AGACC 25 (2) INFORMATION FOR SEQ ID NO: 101: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: TATCTGTTCA CTTGTGCCC 19 (2) INFORMATION FOR SEQ ID NO: 102: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: CAGAGGCCTG GGGACCCTG 19 (2) INFORMATION FOR SEQ ID NO: 103: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: ACGACAGGGC TGGTTGCC 18 (2) INFORMATION FOR SEQ ID NO: 104: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: ACTGACAACC ACCCTTAAC 19 (2) INFORMATION FOR SEQ ID NO: 105: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: CTGCTTGCCA CAGGTCTC 18 (2) INFORMATION FOR SEQ ID NO: 106: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: CACAGCAGGC CAGTGTGC 18 (2) INFORMATION FOR SEQ ID NO: 107: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: GGACCTGATT TCCTTACTG 19 (2) INFORMATION FOR SEQ ID NO: 108: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: TGAATCTGAG GCATAACTG 19 (2) INFORMATION FOR SEQ ID NO: 109: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: TTGCGTACAC ACTGGCCGTC GTTTTACAAC GTCGTGACTG GGAAAACCCT 50 (2) INFORMATION FOR SEQ ID NO: 110: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: GTAAAACGAC GGCCAGTGTG TACGCAA 27 (2) INFORMATION FOR SEQ ID NO: 111: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: TACTGGAAGG CGATCTCAGC AATCAGC 27 (2) INFORMATION FOR SEQ ID NO: 112: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: GGCACGGCTG TCCAAGGAG 19 (2) INFORMATION FOR SEQ ID NO: 113: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: AGGCCGCGCT CGGCGCCCTC 20 (2) INFORMATION FOR SEQ ID NO: 114: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: GUCACUACAG GUGAGCUCCA 20 (2) INFORMATION FOR SEQ ID NO: 115: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: GAATTCGAGC TCGGTACCCG G 21 (2) INFORMATION FOR SEQ ID NO: 116: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: CCGGGTACCG AGCTCGAATT C 21 (2) INFORMATION FOR SEQ ID NO: 117: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: unknown (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: CCTCTTGGGA ACTGTGTAGT ATT 23 (2) INFORMATION FOR SEQ ID NO: 118: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 112 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: AGGCTGTCTC TCTCCCTCTC TCATACACAC ACACACACAC ACACACACAC ACACACACAC 60 ACACACACAC TCACACTCAC CCACANNNAA ATACTACACA GTTCCCAAGA GG 112 (2) INFORMATION FOR SEQ ID NO: 119: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 49 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATAC 49 (2) INFORMATION FOR SEQ ID NO: 120: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 135 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: TAATACGACT CACTATAGGG CGAAGGCTGT CTCTCTCCCT CTCTCATACA CACACACACA 60 CACACACACA CACACACACA CACACACACA CACTCACACT CACCCACANN NAAATACTAC 120 ACAGTTCCCA AGAGG 135 (2) INFORMATION FOR SEQ ID NO: 121: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: AATACTACAC AG 12 (2) INFORMATION FOR SEQ ID NO: 122: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: unknown (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: CTGATGCGTC GGATCATCTT TTTT 24 (2) INFORMATION FOR SEQ ID NO: 123: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: GATGATCCGA CGCATCAGAA TGT 23 (2) INFORMATION FOR SEQ ID NO: 124: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: unknown (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: GATCTAGCTG GGCCGAGCTA GGCCGTTGA 29 (2) INFORMATION FOR SEQ ID NO: 125: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: CTGATGCGTC GGATCATCTT TTTTTTT 27 (2) INFORMATION FOR SEQ ID NO: 126: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: GATGATCCGA CG 12 (2) INFORMATION FOR SEQ ID NO: 127: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: GATGATCCGA CGCAT 15 (2) INFORMATION FOR SEQ ID NO: 128: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: AAAAAAGATG AT 12 (2) INFORMATION FOR SEQ ID NO: 129: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: GATCCGACGC AT 12 (2) INFORMATION FOR SEQ ID NO: 130: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 253 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: GGCACGGCTG TCCAAGGAGC TGCAGGCGGC GCAGGCCCGG CTGGGCGCGG ACATGGAGGA 60 CGTGTGCGGC CGCCTGGTGC AGTACCGCGG CGAGGTGCAG GCCATGCTCG GCCAGAGCAC 120 CGAGGAGCTG CGGGTGCGCC TCGCCTCCCA CCTGCGCAAG CTGCGTAAGC GGCTCCTCCG 180 CGATGCCGAT GACCTGCAGA AGTGCCTGGC AGTGTACCAG GCCGGGGCCC GCGAGGGCGC 240 CGAGCGCGGC CTC 253 (2) INFORMATION FOR SEQ ID NO: 131: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 58 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: GAATTACATT CCCAACCGCG TGGCACAACA ACTGGCGGGC AAACAGTCGT TGCTGATT 58 (2) INFORMATION FOR SEQ ID NO: 132: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 57 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: ACCATTAAAG AAAATATCAT CTTTGGTGTT TCCTATGATG AATATAGAAG CGTCATC 57 (2) INFORMATION FOR SEQ ID NO: 133: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: CTATATTCAT CATAGGAAAC ACCAAAGAT 29 (2) INFORMATION FOR SEQ ID NO: 134: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: CTATATTCAT CATAGGAAAC ACCAAT 26 (2) INFORMATION FOR SEQ ID NO: 135: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: CTATATTCAT CATAGGAAAC ACCAAAGAT 29 (2) INFORMATION FOR SEQ ID NO: 136: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: CTATATTCAT CATAGGAAAC ACCAAAGATG ATATTTTC 38 (2) INFORMATION FOR SEQ ID NO: 137: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: CTATATTCAT CATAGGAAAC ACCAATG ATATTTTC 35 (2) INFORMATION FOR SEQ ID NO: 138: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: CTATATTCAT CATAGGAAAC ACCAAAGATA TTTTC 35 (2) INFORMATION FOR SEQ ID NO: 139: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: CTATATTCAT CATAGGAAAC ACCAAAGATG C 31 (2) INFORMATION FOR SEQ ID NO: 140: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 200 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: CTTCCACCGC GATGTTGATG ATTATGTGTC TGAATTTGAT GGGGGCAGGC GGCCCCCGTC 60 TGTTTGTCGC GGGTCTGGTG TTGATGGTGG TTTCCTGCCT TGTCACCCTC GACCTGCAGC 120 CCAAGCTTGG GATCCACCAC CATCACCATC ACTAATAATG CATGGGCTGC AGCCAATTGG 180 CACTGGCCGT CGTTTTACAA 200 (2) INFORMATION FOR SEQ ID NO: 141: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 99 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: GTCACCCTCG ACCTGCAGCC CAAGCTTGGG ATCCACCACC ATCACCATCA CTAATAATGC 60 ATGGGCTGCA GCCAATTGGC ACTGGCCGTC GTTTTACAA 99 (2) INFORMATION FOR SEQ ID NO: 142: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: TGTACGTCAC AACTA 15 (2) INFORMATION FOR SEQ ID NO: 143: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: TGTACGTCAC AACTAC 16 (2) INFORMATION FOR SEQ ID NO: 144: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: TGTACGTCAC AACTACA 17 (2) INFORMATION FOR SEQ ID NO: 145: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: TGTACGTCAC AACTACAA 18 (2) INFORMATION FOR SEQ ID NO: 146: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: TGTACGTCAC AACTACAAT 19 (2) INFORMATION FOR SEQ ID NO: 147: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: TGTACGTCAC AACTACAATA 20 (2) INFORMATION FOR SEQ ID NO: 148: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: TGTACGTCAC AACTACAATA G 21 (2) INFORMATION FOR SEQ ID NO: 149: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: TGTACGTCAC AACTACAATA GG 22 (2) INFORMATION FOR SEQ ID NO: 150: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: TGTACGTCAC AACTACAATA GGC 23 (2) INFORMATION FOR SEQ ID NO: 151: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: TGTACGTCAC AACTACAATA GGCC 24 (2) INFORMATION FOR SEQ ID NO: 152: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: TGTACGTCAC AACTACAATA GGCCC 25 (2) INFORMATION FOR SEQ ID NO: 153: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: TGTACGTCAC AACTACAATA GGCCCT 26 (2) INFORMATION FOR SEQ ID NO: 154: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: TGTACGTCAC AACTACAATA GGCCCTG 27 (2) INFORMATION FOR SEQ ID NO: 155: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: TGTACGTCAC AACTACAATA GGCCCTGC 28 (2) INFORMATION FOR SEQ ID NO: 156: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: TGTACGTCAC AACTACAATA GGCCCTGCA 29 (2) INFORMATION FOR SEQ ID NO: 157: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: TGTACGTCAC AACTACAATA GGCCCTGCAC 30 (2) INFORMATION FOR SEQ ID NO: 158: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: TGTACGTCAC AACTACAATA GGCCCTGCAC C 31 (2) INFORMATION FOR SEQ ID NO: 159: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: TGTACGTCAC AACTACAATA GGCCCTGCAC CA 32 (2) INFORMATION FOR SEQ ID NO: 160: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: TGTACGTCAC AACTACAATA GGCCCTGCAC CAG 33 (2) INFORMATION FOR SEQ ID NO: 161: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 34 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGG 34 (2) INFORMATION FOR SEQ ID NO: 162: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGC 35 (2) INFORMATION FOR SEQ ID NO: 163: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 36 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCC 36 (2) INFORMATION FOR SEQ ID NO: 164: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 37 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCA 37 (2) INFORMATION FOR SEQ ID NO: 165: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAG 38 (2) INFORMATION FOR SEQ ID NO: 166: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: TGTACGTCAC AACTACAATA GGCCCTGCAC CAGGCCAGA 39 (2) INFORMATION FOR SEQ ID NO: 167: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: CTGATGCGTC GGATCATCC 19 (2) INFORMATION FOR SEQ ID NO: 168: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: CTGATGCGTC GGATCATCCA 20 (2) INFORMATION FOR SEQ ID NO: 169: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: CTGATGCGTC GGATCATCCA G 21 (2) INFORMATION FOR SEQ ID NO: 170: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: CTGATGCGTC GGATCATCCA GC 22 (2) INFORMATION FOR SEQ ID NO: 171: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: CTGATGCGTC GGATCATCCA GCA 23 (2) INFORMATION FOR SEQ ID NO: 172: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: CTGATGCGTC GGATCATCCA GCAG 24 (2) INFORMATION FOR SEQ ID NO: 173: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: CTGATGCGTC GGATCATCCA GCAGC 25 (2) INFORMATION FOR SEQ ID NO: 174: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: CTGATGCGTC GGATCATCCA GCAGCA 26 (2) INFORMATION FOR SEQ ID NO: 175: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: CTGATGCGTC GGATCATCCA GCAGCAG 27 (2) INFORMATION FOR SEQ ID NO: 176: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: CTGATGCGTC GGATCATCCA GCAGCAGC 28 (2) INFORMATION FOR SEQ ID NO: 177: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: CTGATGCGTC GGATCATCCA GCAGCAGCA 29 (2) INFORMATION FOR SEQ ID NO: 178: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: CTGATGCGTC GGATCATCCA GCAGCAGCAG 30 (2) INFORMATION FOR SEQ ID NO: 179: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: CTGATGCGTC GGATCATCCA GCAGCAGCAG C 31 (2) INFORMATION FOR SEQ ID NO: 180: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: CTGATGCGTC GGATCATCCA GCAGCAGCAG CA 32 (2) INFORMATION FOR SEQ ID NO: 181: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAG 33 (2) INFORMATION FOR SEQ ID NO: 182: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 34 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGC 34 (2) INFORMATION FOR SEQ ID NO: 183: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCA 35 (2) INFORMATION FOR SEQ ID NO: 184: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 36 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAG 36 (2) INFORMATION FOR SEQ ID NO: 185: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 37 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGC 37 (2) INFORMATION FOR SEQ ID NO: 186: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCA 38 (2) INFORMATION FOR SEQ ID NO: 187: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAG 39 (2) INFORMATION FOR SEQ ID NO: 188: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC 40 (2) INFORMATION FOR SEQ ID NO: 189: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC A 41 (2) INFORMATION FOR SEQ ID NO: 190: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 42 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AG 42 (2) INFORMATION FOR SEQ ID NO: 191: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGT 43 (2) INFORMATION FOR SEQ ID NO: 192: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTC 44 (2) INFORMATION FOR SEQ ID NO: 193: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCA 45 (2) INFORMATION FOR SEQ ID NO: 194: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCAC 46 (2) INFORMATION FOR SEQ ID NO: 195: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 47 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACG 47 (2) INFORMATION FOR SEQ ID NO: 196: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 48 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGC 48 (2) INFORMATION FOR SEQ ID NO: 197: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 49 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCT 49 (2) INFORMATION FOR SEQ ID NO: 198: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA 50 (2) INFORMATION FOR SEQ ID NO: 199: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 51 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA A 51 (2) INFORMATION FOR SEQ ID NO: 200: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 52 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA AC 52 (2) INFORMATION FOR SEQ ID NO: 201: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 53 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACC 53 (2) INFORMATION FOR SEQ ID NO: 202: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 54 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCG 54 (2) INFORMATION FOR SEQ ID NO: 203: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 55 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGA 55 (2) INFORMATION FOR SEQ ID NO: 204: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 56 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAA 56 (2) INFORMATION FOR SEQ ID NO: 205: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 57 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAAT 57 (2) INFORMATION FOR SEQ ID NO: 206: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 58 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATC 58 (2) INFORMATION FOR SEQ ID NO: 207: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 59 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCC 59 (2) INFORMATION FOR SEQ ID NO: 208: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 60 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 (2) INFORMATION FOR SEQ ID NO: 209: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 61 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 T 61 (2) INFORMATION FOR SEQ ID NO: 210: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 62 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TG 62 (2) INFORMATION FOR SEQ ID NO: 211: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 63 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGG 63 (2) INFORMATION FOR SEQ ID NO: 212: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 64 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGT 64 (2) INFORMATION FOR SEQ ID NO: 213: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 65 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTC 65 (2) INFORMATION FOR SEQ ID NO: 214: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCA 66 (2) INFORMATION FOR SEQ ID NO: 215: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 67 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAG 67 (2) INFORMATION FOR SEQ ID NO: 216: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAGA 68 (2) INFORMATION FOR SEQ ID NO: 217: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 69 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAGAT 69 (2) INFORMATION FOR SEQ ID NO: 218: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAGATC 70 (2) INFORMATION FOR SEQ ID NO: 219: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 71 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAGATC T 71 (2) INFORMATION FOR SEQ ID NO: 220: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 72 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: CTGATGCGTC GGATCATCCA GCAGCAGCAG CAGCAGCAGC AGTCACGCTA ACCGAATCCC 60 TGGTCAGATC TT 72 (2) INFORMATION FOR SEQ ID NO: 221: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: TGCACCTGAC TCC 13 (2) INFORMATION FOR SEQ ID NO: 222: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: TGCACCTGAC TCCT 14 (2) INFORMATION FOR SEQ ID NO: 223: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: TGCACCTGAC TCCTG 15 (2) INFORMATION FOR SEQ ID NO: 224: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: TGCACCTGAC TCCTGT 16 (2) INFORMATION FOR SEQ ID NO: 225: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: TGCACCTGAC TCCTGTG 17 (2) INFORMATION FOR SEQ ID NO: 226: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: TGCACCTGAC TCCTGTGG 18 (2) INFORMATION FOR SEQ ID NO: 227: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 19 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: TGCACCTGAC TCCTGTGGA 19 (2) INFORMATION FOR SEQ ID NO: 228: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: TGCACCTGAC TCCTGTGGAG 20 (2) INFORMATION FOR SEQ ID NO: 229: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: TGCACCTGAC TCCTGTGGAG A 21 (2) INFORMATION FOR SEQ ID NO: 230: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: TGCACCTGAC TCCTGTGGAG AA 22 (2) INFORMATION FOR SEQ ID NO: 231: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: TGCACCTGAC TCCTGTGGAG AAG 23 (2) INFORMATION FOR SEQ ID NO: 232: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: TGCACCTGAC TCCTGTGGAG AAGT 24 (2) INFORMATION FOR SEQ ID NO: 233: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: TGCACCTGAC TCCTGTGGAG AAGTC 25 (2) INFORMATION FOR SEQ ID NO: 234: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: TGCACCTGAC TCCTGTGGAG AAGTCT 26 (2) INFORMATION FOR SEQ ID NO: 235: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: TGCACCTGAC TCCTGTGGAG AAGTCTG 27 (2) INFORMATION FOR SEQ ID NO: 236: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 28 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: TGCACCTGAC TCCTGTGGAG AAGTCTGC 28 (2) INFORMATION FOR SEQ ID NO: 237: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 29 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: TGCACCTGAC TCCTGTGGAG AAGTCTGCC 29 (2) INFORMATION FOR SEQ ID NO: 238: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG 30 (2) INFORMATION FOR SEQ ID NO: 239: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 31 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG T 31 (2) INFORMATION FOR SEQ ID NO: 240: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 32 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TT 32 (2) INFORMATION FOR SEQ ID NO: 241: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTA 33 (2) INFORMATION FOR SEQ ID NO: 242: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 34 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTAC 34 (2) INFORMATION FOR SEQ ID NO: 243: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACT 35 (2) INFORMATION FOR SEQ ID NO: 244: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 36 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTG 36 (2) INFORMATION FOR SEQ ID NO: 245: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 37 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGC 37 (2) INFORMATION FOR SEQ ID NO: 246: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCC 38 (2) INFORMATION FOR SEQ ID NO: 247: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 39 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCC 39 (2) INFORMATION FOR SEQ ID NO: 248: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 40 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT 40 (2) INFORMATION FOR SEQ ID NO: 249: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 41 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT G 41 (2) INFORMATION FOR SEQ ID NO: 250: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 42 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GT 42 (2) INFORMATION FOR SEQ ID NO: 251: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTG 43 (2) INFORMATION FOR SEQ ID NO: 252: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGG 44 (2) INFORMATION FOR SEQ ID NO: 253: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGG 45 (2) INFORMATION FOR SEQ ID NO: 254: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGG 46 (2) INFORMATION FOR SEQ ID NO: 255: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 47 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGC 47 (2) INFORMATION FOR SEQ ID NO: 256: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 48 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCA 48 (2) INFORMATION FOR SEQ ID NO: 257: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 49 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAA 49 (2) INFORMATION FOR SEQ ID NO: 258: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG 50 (2) INFORMATION FOR SEQ ID NO: 259: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 51 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG G 51 (2) INFORMATION FOR SEQ ID NO: 260: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 52 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: TGCACCTGAC TCCTGTGGAG AAGTCTGCCG TTACTGCCCT GTGGGGCAAG GT 52 (2) INFORMATION FOR SEQ ID NO: 261: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 209 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 60 GACTCCTGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC AAGGTGAACG TGGATGAAGT 120 TGGTGGTGAG GCCCTGGGCA GGTTGGTATC AAGGTTACAA GACAGGTTTA AGGAGACCAA 180 TAGAAACTGG GCATGTGGAG ACAGAGAAG 209 (2) INFORMATION FOR SEQ ID NO: 262: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 88 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: TGAGACTCTG TCTCAAAAAT AAATAAATAA ATAAATAAAT AAATAAATAA ATAAATAAAT 60 AAATAAATAA GTAAAAAAGA AAGAATGC 88 (2) INFORMATION FOR SEQ ID NO: 263: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 53 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: GTGTGTGTGT GTGTGTGTTT TTTTTTAACA GGGATTTGGG GAATTATTTG AGA 53 (2) INFORMATION FOR SEQ ID NO: 264: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 23 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: TTCCCCAAAT CCCTGTTAAA AAC 23 (2) INFORMATION FOR SEQ ID NO: 265: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: TTCCCCAAAT CCCTGTTAAA AAAAC 25 (2) INFORMATION FOR SEQ ID NO: 266: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: TTCCCCAAAT CCCTGTTAAA AAAAAAC 27 (2) INFORMATION FOR SEQ ID NO: 267: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 103 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: GTAAAACGAC CGCCAGTGCC AAGCTTGCAT GCCTGCAGGT CGACTCTAGA GGATCCCCGG 60 GTACCGAGCT CGAATTCGTA ATCATGGTCA TAGCTGTTTC CTG 103 (2) INFORMATION FOR SEQ ID NO: 268: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 78 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: GAGTCAGGTG CGCCATGCCT CAAACAGACA CCATGGTGCA CCTGACTCCT GAGGAGNCTG 60 GGCATGTGGA GACAGAGA 78 (2) INFORMATION FOR SEQ ID NO: 269: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 78 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: TCTCTGTCTC CACATGCCCA GNCTCCTCAG GACTCAGGTG CACATGGTGT CTGTTTGAGG 60 CATGGCGCAC CTGAGCTC 78 (2) INFORMATION FOR SEQ ID NO: 270: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 78 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 GCATGGCGCA CGTGACTC 78 (2) INFORMATION FOR SEQ ID NO: 271: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 82 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CGCCATGGTG TCTGTTTGAG 60 GCATGGCGCA CCTGACTCCT GA 82 (2) INFORMATION FOR SEQ ID NO: 272: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 42 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: TCTCTGTCTC CACATGCCCA GNCTCCTCAG GAGTCAGGTG CG 42 (2) INFORMATION FOR SEQ ID NO: 273: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: CACCTGACTC CTA 13 (2) INFORMATION FOR SEQ ID NO: 274: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: CACCTGACTC CTGGA 15 (2) INFORMATION FOR SEQ ID NO: 275: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 14 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: CACCTGACTC CTGA 14 (2) INFORMATION FOR SEQ ID NO: 276: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: CCATGGTGTC TGTTTGAGGC ATGGCG 26 (2) INFORMATION FOR SEQ ID NO: 277: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 75 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 AGTCATCTTG GGGCT 75 (2) INFORMATION FOR SEQ ID NO: 278: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 61 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: ACCTAGCGTT CAGTTCGACT GAGATAATAC GACTCACTAT AGCAGCTCTC ATTTTCCATA 60 C 61 (2) INFORMATION FOR SEQ ID NO: 279: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 20 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: RNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: GUCACUACAG GUGAGCUCCA 20 (2) INFORMATION FOR SEQ ID NO: 280: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: CTCAGTCCAC GTGGTACCCT GCTG 24 (2) INFORMATION FOR SEQ ID NO: 281: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 85 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA TGGTGCACCT 60 GACTCCTGAG GAGAAGTCTG CCGTT 85 (2) INFORMATION FOR SEQ ID NO: 282: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: ACGGGTCCCG GAGTGGTGTC GC 22 (2) INFORMATION FOR SEQ ID NO: 283: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 76 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 GTATCAAGGT TACAAG 76 (2) INFORMATION FOR SEQ ID NO: 284: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 76 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGATTG 60 GTATCAAGGT TACAAG 76 (2) INFORMATION FOR SEQ ID NO: 285: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 76 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 CTATCAAGGT TACAAG 76 (2) INFORMATION FOR SEQ ID NO: 286: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 76 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: ACTGCCCTGT GGGGCAAGGT GAACGTGGAT GAAGTTGGTG GTGAGGCCCT GGGCAGGTTG 60 GCATCAAGGT TACAAG 76 (2) INFORMATION FOR SEQ ID NO: 287: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 48 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: ACAGGTTTAA GGAGACCAAT AGAAACTGGG CATGTGGAGA CAGAGAAG 48 (2) INFORMATION FOR SEQ ID NO: 288: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: GACGACGACT GCTACCTGAC TCCA 24 (2) INFORMATION FOR SEQ ID NO: 289: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: ACAGCGCACT GCTACCTGAC TCCA 24 (2) INFORMATION FOR SEQ ID NO: 290: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: TGGAGTCAGG TAGCAGTC 18 (2) INFORMATION FOR SEQ ID NO: 291: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 60 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 (2) INFORMATION FOR SEQ ID NO: 292: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 65 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: AGTCATCTTG GGGCTGTCGA GAGTAAAAGG TATGTCAGTC ATAGTTAAGA CCTTCTTAAA 60 GGTCT 65 (2) INFORMATION FOR SEQ ID NO: 293: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 25 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: GTAATTTCTA TCAGTAGAAC CCCGA 25 (2) INFORMATION FOR SEQ ID NO: 294: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 60 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 (2) INFORMATION FOR SEQ ID NO: 295: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: AGTCATCTTG GGGCT 15 (2) INFORMATION FOR SEQ ID NO: 296: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 60 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: CAGCTCTCAT TTTCCATACA GTCAGTATCA ATTCTGGAAG AATTTCCAGA CATTAAAGAT 60 (2) INFORMATION FOR SEQ ID NO: 297: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 16 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: AGTCATCTTG GGGCTA 16 (2) INFORMATION FOR SEQ ID NO: 298: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCT 43 (2) INFORMATION FOR SEQ ID NO: 299: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: CAGCTCTCAT TTTCCATACA TTAAAGATAG TCATCTTGGG GCTA 44 (2) INFORMATION FOR SEQ ID NO: 300: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: CAGCTCTCAT TTTCCATACA GT 22 (2) INFORMATION FOR SEQ ID NO: 301: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: CAGCTCTCAT TTTCCATACA T 21 (2) INFORMATION FOR SEQ ID NO: 302: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: GCCTGGTACA CTGCCAGGCG CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 50 (2) INFORMATION FOR SEQ ID NO: 303: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 50 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: GCCTGGTACA CTGCCAGGCA CTTCTGCAGG TCATCGGCAT CGCGGAGGAG 50 (2) INFORMATION FOR SEQ ID NO: 304: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: GATGCCGATG ACCTGCAGAA G 21 (2) INFORMATION FOR SEQ ID NO: 305: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: GATGCCGATG ACCTGCAGAA GC 22 (2) INFORMATION FOR SEQ ID NO: 306: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: GATGCCGATG ACCTGCAGAA GTGC 24 (2) INFORMATION FOR SEQ ID NO: 307: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: GATGATCCGA CG 12 (2) INFORMATION FOR SEQ ID NO: 308: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 18 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: CTGATGCGTC GGATCATC 18 (2) INFORMATION FOR SEQ ID NO: 309: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 12 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: GATGATCCGA CG 12 (2) INFORMATION FOR SEQ ID NO: 310: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 35 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: GGCGCGGACA TGGAGGACGT GTGCGGCCGC CTGGT 35 (2) INFORMATION FOR SEQ ID NO: 311: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 34 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: TCCGCGATGC CGATGACCTG CAGAAGCGCC TGGC 34 (2) INFORMATION FOR SEQ ID NO: 312: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: CGGCTGCGAT CACCGTGCGG CACAGCT 27 (2) INFORMATION FOR SEQ ID NO: 313: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: CGGCTGCGAT CACCGTGCGG T 21 (2) INFORMATION FOR SEQ ID NO: 314: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: CGGCTGCGAT CACCGTGCGG AACAGCT 27 (2) INFORMATION FOR SEQ ID NO: 315: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: CGGCTGCGAT CACCGTGCGG CA 22 (2) INFORMATION FOR SEQ ID NO: 316: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: CGGCTGCGAT CACCGTGCGG TA 22 (2) INFORMATION FOR SEQ ID NO: 317: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: CGGCTGCGAT CACCGTGCGG A 21 (2) INFORMATION FOR SEQ ID NO: 318: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 42 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: ATCATCAACT GGAAGATCAG GTCAGGAGCC ACTTGCCANC CT 42 (2) INFORMATION FOR SEQ ID NO: 319: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: ATCATCACAC TGGAAGACTC CAGGTCAGGA GCC 33 (2) INFORMATION FOR SEQ ID NO: 320: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 38 base pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: single (D) TOPOLOGY: unknown (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: <Unknown> (vi) ORIGINAL SOURCE: (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: ATCCACTACA ACTACATGTG TAACAGTTGG WGCWWGCC 38 

What is claimed is:
 1. A process for determining the sequence of a target nucleic acid molecule comprising the steps of: a) generating at least two nucleic acid fragments from the target nucleic acid; and b) analyzing the at least two fragments by a mass spectrometry form, and thereby determining the sequence of the target nucleic acid molecule, wherein the sequence of the target nucleic acid sequence is indicative of a disease or condition or of the absence of the disease or condition.
 2. A process of claim 1, wherein in step a), an endonuclease is contacted with the target nucleic acid to generate the at least two nucleic acid fragments.
 3. A process of claim 2, wherein the endonuclease is a restriction enzyme that can recognize and cleave at least one restriction site in the target nucleic acid.
 4. A process of claim 2, wherein the target nucleic acid is a deoxyribonucleic acid and the nuclease is a deoxyribonuclease.
 5. A process of claim 2, wherein the target nucleic acid is a ribonucleic acid and the nuclease is a ribonuclease.
 6. A process of claim 5, wherein the ribonuclease is selected from the group consisting of: the G-specific T₁ ribonuclease, the A-specific U₂ ribonuclease, the A/U specific PhyM ribonuclease, the U/C specific ribonuclease A, the C-specific chicken liver ribonuclease and crisavitin.
 7. The process of claim 1, wherein in step a), nucleic acid fragments are generated by performance of a combined amplification and base-specific termination reaction.
 8. A process of claim 7, wherein the combined amplification and base-specific termination reaction is performed using a first polymerase, which has a relatively low affinity towards at least one chain terminating nucleotide and an a second polymerase, which has a relatively high affinity towards at least one chain terminating nucleotide.
 9. A process of claim 8, wherein the first and second polymerases are thermostable DNA polymerases.
 10. A process of claim 1, wherein the at least two fragments comprise a 3′ tag or the at least two fragments comprise a 5′ tag.
 11. A process of claim 10, wherein tag is biotin or a nucleic acid fragment that binds to a capture nucleic acid sequence that is bound to a solid support.
 12. A process of claim 1, wherein the process additionally comprises the step of: ordering the at least two nucleic acid fragments to determine the sequence of the target nucleic acid.
 13. A process for detecting a target nucleic acid present in a biological sample, comprising the steps of: a) performing a first polymerase chain reaction on a nucleic acid obtained from a biological sample, using a first set of primers that are capable of amplifying a portion of the nucleic acid containing the target nucleic acid, thereby producing a first amplification product; and b) detecting the first amplification product by mass spectrometry, wherein detection of the target nucleic acid indicates that the target nucleic acid is present in the biological sample.
 14. A process of claim 13, wherein prior to step b), a second polymerase chain reaction is performed on the first amplification product using a second set of primers, which are capable of amplifying at least a portion of the first amplification product, which contains the target nucleic acid.
 15. A process of claim 13, wherein prior to step b), the target nucleic acid is immobilized on a solid support.
 16. A process of claim 15, wherein the target nucleic acid is reversibly immobilized.
 17. A process of claim 16, wherein the target nucleic acid can be cleaved from the solid support by a chemical, enzymatic or physical process.
 18. A process of claim 17, wherein immobilization is effected via a photocleavable bond.
 19. A process of claim 16, wherein the target nucleic acid is cleaved from the support during step b).
 20. A process of claim 15, wherein the solid support is selected from the group consisting of: beads, flat surfaces, chips, capillaries, pins, combs and wafers.
 21. A process of claim 15, wherein immobilization is accomplished by hybridization between a complementary capture nucleic acid molecule immobilized to a solid support, and a portion of the nucleic acid molecule, which is distinct from the target nucleic acid sequence.
 22. A process of claim 13, wherein prior to step b), the target nucleic acid is purified.
 23. A process of claim 13, wherein the primer or first or second amplification product is conditioned.
 24. A process of claim 23, wherein the primer or first or second amplification product is conditioned by phosphodiester backbone modification.
 25. A process of claim 24, wherein the phosphodiester backbone modification is a cation exchange.
 26. A process of claim 23, wherein the primer or first or second amplification product is conditioned by contact with an alkylating agent or trialkylsilyl chloride.
 27. A process of claim 23, wherein conditioning is effected by including at least one nucleotide that reduces sensitivity for depurination in the primer or first or second amplification product.
 28. A process of claim 27, wherein the nucleotide is an N7- or N9-deazapurine nucleotide or 2′ fluoro 2′ deoxy nucleotide.
 29. A method for detecting neoplasia/malagnancies in a tissue or cell sample, comprising detecting telomerase activity, mutation of a proto-oncogene, expression of a tumor specific gene in the sample by detecting nucleic acids that encode the telomerase, that are specific for the mutation or that encode the tumor-specific by mass spectometry.
 30. The method of claim 29 that is a method for detecting neoplasia/malagnancies in a tissue or cell sample, comprising: a) isolating telomerase from the sample and adding a synthetic DNA primer, which is optionally immobilized, complementary to a telomeric repeat, and all four deoxynucleotide triphosphates under conditions that result in telomerase specific extension of the synthetic DNA; b) amplifying the telomerase extended DNA product; and c) detecting the DNA product by mass spectrometry, wherein telomerase-specific extension is indicative of neoplaisa/malignancy.
 31. The method of claim 30, wherein the primer contains a linker moiety for immobilization on a support; and the amplified primers are isolated conjugating the linker portion to a solid support.
 32. The method of claim 29 that is a method for identifying transformed cells or tissues, comprising: a) in a cell or tissue sample, amplifying a portion of a proto-oncogene that includes a codon indicative of transformation, wherein one primer comprises a linker moiety for immobilization; c) immobilizing DNA via the linker moiety to a solid support, optionally in the form of an array; d) hybridizing a primer complementary to the proto oncogene sequence that is upstream from the codon e) adding 3dNTPs/1 ddNTP and DNA polymerase and extending the hybridized primer to the next ddNTP location; f) ionizing/volatizing the sample; and g) detecting the mass of the extended DNA, whereby mass indicates the presence of wild-type or mutant alleles. The presence of a mutant allele at the codon is diagnostic for neoplasia.
 33. The method of claim 32, wherein the proto-oncogene is the RET-proto-oncogene.
 34. The method of claim 32 that is a method for detecting expression of a tumor-specific gene, comprising: a) isolating polyA RNA from the sample; c) preparing a cDNA library using reverse transcription; d) amplifing a cDNA product, or portion thereof, of the tumor-specific gene, wherein one oligo primer comprises a linker moiety; e) isolating the amplified product by immobilizing the DNA to a solid support via the linker moiety; f) optionally conditioning the DNA: g) ionizing/volatizing sample and detecting the presence of a DNA peak that is indicative of expression of the gene.
 35. The method of claim 34, wherein the cells are bone marro cells, the gene is the tyrosine hydroxylase gene, and expression of the gene is indicative of neuroblastoma.
 36. A method for directly detecting a double-stranded nucleic acid using matrix-assisted laser desorption/ionization (MALDI)-time-of- flight (TOF) mass spectrometry, comprising: a) isolating a double-stranded DNA fragment from a cell or tissue sample; b) preparing the double-stranded DNA for analysis under conditions that increase the ratio of dsDNA:ssDNA, wherein the conditions include one or all of the following: preparing samples for analysis at reduced temperatures (i.e. 4° C.), and using of higher DNA concentrations in the matrix to drive duplex formation; c) ionizing/volatizing the sample of step b), wherein low acceleration voltage of the ions are used; d) detecting the presence of the double-straned DNA.
 37. A method for comparing DNA samples to discern relatedness or to detect mutations, comprising: a) obtaining biological a plurality of samples; b) amplifying a region of DNA from each sample that contains two or more microsatellite DNA repeat sequences; c) ionizing/volatizing the amplified DNA; d) detecting the presence of the amplified DNA and comparing the molecular weight of the amplified DNA, wherein different sizes are indicative of non-identity between or among the samples.
 38. The method of claim 37, wherein non-identity is indicative of the presence of a mutuation in the DNA in one sample, non- relatedness or non-HLA compatibility between or among the individuals from whom the samples were obtained.
 39. The method of claim 37, wherein a plurality of markers are examined simultaneoulsy.
 40. A method for detecting a target nucleic acid in a sample, comprising: a) amplifying a target nucleic acid sequence using; (i) a first primer, wherein: the 5′-end shares identity to a portion of the target DNA immediately downstream from the targeted codon followed by a sequence that introduces a unique restriction endonuclease site, and the 3′-end primer is self-complementary; and (ii) a second downstream primer that contains a tag; b) immobilizing the double-stranded amplified DNA to a solid support via the linker moiety; c) denaturing the immobilized DNA and isolating the non-immobilized DNA strand; d) annealing the intracomplementary sequences in the 3′-end of the isolated non-immobilzed DNA strand, such that the 3′-end is extendable by a polymerase; f) extending the annealed DNA by adding DNA polymerase, 3 dNTPs/1 ddNTP; g) cleaving the extended double stranded stem loop DNA with the unique restriction endonuclease and removing the cleaved stem loop DNA; i) ionizing/volatizing the extended product; and j) detecting the presence of the extended target nucleic acid, whereby the presence of a DNA fragment of a mass different from wild-type is indicative of a mutation at the target codon(s).
 41. A method for detecting a target nucleic acid in a biological sample using RNA amplification, comprising: amplifying the target nucleic acid using a primer comprising a region complementary to the target sequence and a region that encodes a promoter; synthesizing RNA using an RNA polymerase the recognizes the promoter; detecting the resulting RNA using mass spectrometry.
 42. A primers for mass spectrometric analyses, comprising all or at least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences set forth in SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128 and 129, wherein the primer is unlabled.
 43. The primers of claim 48, further comprising a mass modifying moiety.
 44. A process for detecting a target nucleic acid sequence present in a biological sample, comprising the steps of: a) obtaining a nucleic acid molecule containing a target ncleic acid sequence from a biological sample; b) immobilizing the target sequence on the support via thiol linkages, whereby the target is present at a sufficient density to detect it using mass spectrometry; c) hybridizing a detector oligonucleotide with the target nucleic acid sequence; d) removing unhybridized detector oligonucleotide; e) ionizing and volatizing the product of step c); and f) detecting the detector oligonucleotide by mass spectrometry, wherein detection of the detector oligonucleotide indicates the presence of the target nucleic acid sequence in the biological sample.
 45. The process of claim 44, wherein the target nucleic acid molecule is amplified prior to immobilization.
 46. The process of claim 44, wherein at least one of the detector oligonucleotide or the target nucleic acid sequence has been conditioned.
 47. A process of claim 44, wherein target nucleic acid is immobilized in the form of an array.
 48. A process of claim 47, wherein the support is a silicon wafer.
 49. A process of claim 45, wherein the target nucleic acid moelcule is amplified by an amplification procedure selected from the group consisting of cloning, transcription, the polymerase chain reaction (PCR), the ligase chain reaction (LCR), and strand displacement amplification (SDA).
 50. A process of any of claims 44, wherein the sample is conditioned by mass differentiating.
 51. The process of claim 50, wherein the sample is conditioned by mass differentiating at least two detector oligonucleotides or oligonucleotide mimetics to detect and distinguish at least two target nucleic acid sequences simultaneously.
 52. A process of claim 50, wherein the mass differentiation is acheived by the introduction of mass modifying functionalities in the base, sugar or phosphate moiety of the detector oligonucleotides.
 53. A process of claim 50, wherein the mass differentiation is achieved by exchange of cations at the phosphodiester bond.
 54. A process of claim 44, wherein the nucleic acid molecule obtained from a biological sample is amplified into RNA using mass modified ribonucleoside triphosphates and DNA dependent RNA polymerase prior to mass spectrometric detection.
 55. The process of claim 1, wherein the target nucleic acid sequence is indicative of a disease or condition selected from the group consisting of a genetic disease, a chromosomal abnormality, a genetic predisposition, a viral infection, a fungal infection and a bacterial infection.
 56. A method of determining a sequence of a nucleic acid, comprising the steps of: (i) obtaining multiple copies of the nucleic acid to be sequenced: (ii) cleaving the multiple copies from a first end to a second end with an exonuclease to sequentially release individual nucleotides; (iii) identifying each of the sequentially released nucleotides by mass spectrometry; and (iv) determining the sequence of the nucleic acid from the identified nucleotides, wherein the nucleic acid is immobilized by covalent attachment to a solid support via at least one sulfur atom.
 57. A method of determining a sequence of a nucleic acid, comprising the steps of: (i) obtaining multiple copies of the nucleic acid to be sequenced; (ii) cleaving the multiple copies from a first end to a second end with an exonuclease to produce multiple sets of nested nucleic acid fragments; (iii) determining the molecular weight value of each one of the sets of nucleic acid fragments by mass spectrometry; and (iv) determining the sequence of the nucleic acid from the molecular weight values of the sets of nucleic acid fragments, wherein the nucleic acid is immobilized by covalent attachment to a solid support via at least one sulfur atom.
 58. The method of claims 57, wherein the exonuclease is selected from the group consisting of snake venom phosphodiesterase, spleen phosphodiesterase, Bal-31 nuclease, E. coli exonuclease I, E. coli exonuclease VII, Mung Bean Nuclease, S1 Nuclease, an exonuclease activity of E. coli DNA polymerase 1, an exonuclease activity of a Klenow fragment of DNA polymerase 1, an exonuclease activity of T4 DNA polymerase, an exonuclease activity of T7 DNA polymerase, an exonuclease activity of Taq DNA polymerase, an exonuclease activity of Pyrococcus species GB-D DNA polymerasepolymerase, E. coli exonuclease III, lambda exonuclease and an exonuclease activity of Thermococcus litoralis DNA polymerase polymerase.
 59. The method of any of claims 57, wherein the nucleic acid comprises mass-modified nucleotides.
 60. The method of claim 59, wherein the mass-modified nucleotides modulate the rate of the exonuclease activity.
 61. The method of claim 59, wherein the sequentially released nucleotides are mass-modified subsequent to exonuclease release and prior to mass spectrometric identification.
 62. The method of claim 61, wherein the sequentially released nucleotides are mass-modified by contact with an alkaline phosphatase.
 63. A primer, comprising all least about 20, preferably about 16, bases of any of the sequence of nucleotides sequences set forth in SEQ ID NOs. 1-22, 24, 27-38, 41-86, 89, 92, 95, 98, 101-110, 112-123, 126, 128 and
 129. 64. The primers of claim 63 that is unlabeled, and optionally includes a mass modifying moiety, which is preferably attached to the 5′end.
 65. The method of claim 1, wherein nucleic acid is immobilized to a solid support via a selectively cleavable linker.
 66. The method of claim 65, wherein the linker is thermocleavable, enzymatically cleavable, photocleavable or chemically cleavable.
 67. The method of claim 65, wherein the linker is a trityl linker.
 68. The method of claim 65, wherein the linker is selected from the group consisting of 1-(2-nitro-5-(3-O-4,4′-dimethoxytritylpropoxy)-phenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane and 1- (4-(3-O-4,4′-dimethoxytritylpropoxy)-3-methoxy-6-nitrophenyl)- 1 -0-((2-cyanoethoxy)-diisopropylaminophosphino)ethane.
 69. A photocleaveable linker, comprising a compound of formula:

wherein: R²⁰ is selected from the group consisting of ω-(4,4′-dimethoxytrityloxy)alkyl and ω-hydroxyalkyl; R²¹ is selected from the group consisting of hydrogen, alkyl, aryl, alkoxycarbonyl, aryloxycarbonyl and carboxy; R²² is selected from the group consisting of hydrogen and (dialkylamino) (ω-cyanoalkoxy) P-; t is 0-3; and R⁵⁰ is selected from the group consisting of alkyl, alkoxy, aryl and aryloxy.
 70. The photocleavable linker of claim 69, wherein the linkers are of formula II:

wherein: R²⁰ is selected from the group consisting of ω-(4,4′-dimethoxytrityloxy)alkyl, ω-hydroxyalkyl and alkyl; R²¹ is selected from the group consisting of hydrogen, alkyl, aryl, alkoxycarbonyl, aryloxycarbonyl and carboxy; R²² is selected from the group consisting of hydrogen and (dialkylamino)(ω-cyanoalkoxy)P-; and X²⁰ is selected from the group consisting of hydrogen, alkyl or OR²⁰.
 86. The photocleavable linker of claim 85, wherein: R²⁰ is selected from the group consisting of 3-(4,4′-dimethoxytrityloxy)propyl, 3-hydroxypropyl and methyl; R²¹ is selected from the group consisting of hydrogen, methyl and carboxy; R²² is selected from the group consisting of hydrogen and (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is selected from the group consisting of hydrogen, methyl or OR²⁰.
 71. The photocleavable linker of claim 70, wherein: R²⁰ is 3-(4,4′-dimethoxytrityloxy)propyl; R²¹ is methyl; R²² is (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is hydrogen.
 72. The photocleavable linker of claim 70, wherein: R²⁰ is methyl; R²¹ is methyl; R²² is (diisopropylamino)(2-cyanoethoxy)P-; and X²⁰ is 3-(4,4′-dimethoxytrityloxy)propoxy.
 73. A photocleavable linker, comprising a compound of formula III:

wherein: R²³ is selected from the group consisting of hydrogen and (dialkylamino)(ω-cyanoalkoxy)P-; R²⁴ is selected from ω-hydroxyalkoxy, ω-(4,4′-dimethoxytrityloxy)alkoxy, ω-hydroxyalkyl and ω-(4,4′-dimethoxytrityloxy)alkyl, and is unsubstituted or substituted on the alkyl or alkoxy chain with one or more alkyl groups; r and s are each independently 0-4; and R⁵⁰ is alkyl, alkoxy, aryl or aryloxy.
 74. The photocleavable linker of claim 73, wherein: R²⁴ is ω-hydroxyalkyl or ω-(4,4′-dimethoxytrityloxy)alkyl, and is substituted on the alkyl chain with a methyl group.
 75. The photocleavable linker of claim 73, wherein: R²³ is selected from the group consisting of hydrogen and (diisopropylamino)(2-cyanoethoxy)P-; and R²⁴ is selected from the group consisting of 3-hydroxypropoxy, 3-(4,4′-dimethoxytrityloxy)propoxy, 4-hydroxybutyl, 3-hydroxy-1-propyl, 1-hydroxy-2-propyl, 3-hydroxy-2-methyl-1-propyl, 2-hydroxyethyl, hydroxymethyl, 4-(4,4′-dimethoxytrityloxy)butyl, 3-(4,4′-dimethoxytrityloxy)-1-propyl, 2-(4,4′-dimethoxytrityloxy)ethyl, 1-(4,4′-dimethoxytrityloxy)-2-propyl, 3-(4,4′-dimethoxytriyloxy)-2-methyl-1-propyl and 4,4′-dimethyoxytrityloxymethyl.
 76. The photocleavable linker of claim 75, wherein r and s are both
 0. 77. The photocleavable linker of claim 76, wherein: R²³ is (diisopropylamino)(2-cyanoethoxy)P-; and R²⁴ is selected from the group consisting of 3-(4,4′-dimethoxy-trityloxy)propoxy, 4-(4,4′-dimethoxytrityloxy)butyl, 3-(4,4′-dimethoxy-trityloxy)propyl, 2-(4,4′-dimethoxytrityloxy) ethyl, 1-(4,4′-dimethoxy-trityloxy)-2-propyl, 3-(4,4′-dimethoxytriyloxy)-2-methyl-1-propyl and 4,4′-dimethyoxytrityloxymethyl.
 78. The photocleavable linker of claim 77, wherein: R²⁴ is 3-(4,4′-dimethoxytrityloxy)propoxy.
 79. The photocleavable linker of claim 69, where in the linker is selected from the group consisting of 1-(2-nitro-5-(3-O-4,4 ′-dimethoxy-tritylpropoxy)phenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)-ethane and 1-(4-(3-O-4,4′-dimethoxytritylpropoxy)-3-methoxy-6-nitro-phenyl)-1-O-((2-cyanoethoxy)-diisopropylaminophosphino)ethane. 